REMARKS 

I. Status of the Application 

Claims 1, 2, 4-9 and 1 1-23 are presently pending in the application. Claims 19-23 stand 
withdrawn from consideration as being directed to non-elected subject matter. Claims 11-13, 15- 
19, 22 and 23 have been cancelled without prejudice to the filing of any appropriate continuation 
applications as being directed to non-elected subject matter. Claims 1, 2, 4-9 and 14 stand 
rejected under 35 U.S.C. § 1 12, first paragraph, as failing to comply with the written description 
requirement. Claim 14 stands rejected under 35 U.S.C. § 112, first paragraph, as lacking 
enablement. 

Applicants have amended the claims under consideration to more clearly define and 
distinctly characterize Applicants' novel invention. Claim 1 was amended to recite three Ets 
binding sites and one Sp-1 binding site, support for which can be found in the published 
application at least at Figure 1, where Applicants teach the presence of three Ets binding sites 
and one Sp-1 binding site in the -778 to -422 region of the EGP-2 promoter sequence; and to 
replace reference to Figure 1 with reference to the corresponding region of SEQ ID NO:5. Claim 
14 was amended to recite a suicide gene, support for which can be found at least at paragraph 
[0019] of the published application, and to address formal matters. Claim 2 was amended to 
correct a typographical error and to address formal matters. Claim 20 was amended to depend 
from claim 5 and to address formal matters. Claim 21 was amended to recite "nucleic acid" and 
delete "method" in view of the amendment of claim 20 and to address formal matters. Claims 4- 
9 and 24 were amended to address formal matters. 

Applicants respectfully submit that the amendments presented herein do not raise new 
issues requiring further search. Applicants respectfully request entry and consideration of the 
foregoing amendments, which are intended to place the case in condition for allowance, or at 
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least in better condition for appeal. 

II. Formal Matters 

The Office Action, at page 2, asserts that claims 1, 2, 4-9 and 11-23 are pending in the 
application. However, Applicants respectfully submit that claims 1, 2, 4-9 and 11-24 are 
pending in the instant application. Claim 24 was added in Applicants' Supplemental 
Amendment mailed on May 9, 2006, a copy of which is provided herewith. 

III. Restriction 

The Office Action, at page 2, asserts that newly submitted claims 19-23 are directed to an 
invention that is independent or distinct from the invention originally claimed because they are 
drawn to a method of manufacture of a fusion construct, whereas the elected invention is an 
isolated nucleic acid. The Office Action states that the nucleic acid of Group I is patentably 
distinct from the claimed method of claims 19-23 because the nucleic acid of Group I can also be 
made by chemical synthesis, and accordingly, there would have been a search burden to examine 
these claims in a single application. Applicants respectfully traverse this restriction based on the 
amended claims now presented. 

Without acquiescing to the instant restriction, Applicants respectfully submit that claims 
19, 22 and 23 have been cancelled without prejudice to the filing of any appropriate continuation 
applications as being directed to non-elected subject matter. Claim 20 was amended to recite "a 
nucleic acid according to claim 5" and to delete reference to the method of cancelled claim 19. 
Claim 21 was amended to recite a "nucleic acid" and to delete "method." Thus, claims 20 and 
21 ultimately depend from claim 1, and, therefore, pertain to elected subject matter. 
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Accordingly, Applicants respectfully request that the restriction of claims 20 and 21 be 
withdrawn. 

IV. The Specification Provides Adequate Written Description for the Pending Claims 

At page 3 of the instant Office Action, claims 1, 2, 4-9 and 14 stand rejected under 35 
U.S.C. § 112, first paragraph, as failing to comply with the written description requirement. 
Applicants respectfully traverse this rejection in view of the amended claims now presented. 
Applicants respectfully submit that the specification provides adequate written of the claimed 
nucleic acid sequence, and that one of skill in the art would readily understand the structure of 
the claimed nucleic acid sequence based on Applicants' teachings. 

Without acquiescing to the rejection, Applicants respectfully submit that claim 1 has been 
amended to include the structural feature that at least three Ets binding sites and at least one Sp-1 
binding site are present in the claimed promoter region. This claimed structural feature 
correlates with the claimed function of allowing expression of a nucleic acid of interest operably 
linked to the promoter in a cancer cell in an epithelium-selective manner. Applicants' novel 
invention is based in part on Applicants' elucidation of the basic minimal promoter region of the 
EGP-2 promoter that can bind the RNA pol II complex. Applicants have discovered that 
constructs containing the claimed nucleic acid sequence having the claimed binding sites are 
capable of mediating epithelial-specific expression (published application, paragraphs [0068] and 
[0076]). 

The Office Action, at page 4, states that the claimed invention as a whole may not be 
adequately described where an invention is described solely in terms of a method of its making 
coupled with its function and there is no described or art-recognized correlation between the 
structure of the invention and its function. The U.S. Patent and Trademark Office's Written 
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Description Guidelines state: 

The written description requirement for a claimed genus may be 
satisfied through sufficient description of a representative number 
of species by . . . disclosure of relevant, identifying characteristics, 
i.e., structure or other physical and/or chemical properties, by 
functional characteristics coupled with a known or disclosed 
correlation between function and structure, or by a combination 
of such identifying characteristics, sufficient to show the applicant 
was in possession of the claimed genus 

66 Fed. Reg. 1099, 1106 (January 5, 2001), internal reference omitted, approved in Enzo 
Biochem, Inc. v. Gen-Probe Incorporated, 296 F.3d 1316, 1325, 63 U.S.P.Q.2d (BNA) 1609, 
1613 (Fed. Cir. 2002) (emphasis added). 

Applicants' claimed structural feature of having at least three Ets binding sites and at 
least one Sp-1 binding site in the claimed promoter region correlates with the claimed function 
of allowing expression of a nucleic acid of interest operably linked to the promoter in a cancer 
cell in an epithelium-selective manner. These binding sites were well known in the art at the 
time of filing to have the function of regulating transcriptional activation in an epithelial cell- 
specific manner. In fact, Applicants' specification discloses that the Ets binding sites were 
known to modulate epithelium-specific transcription, and that that a combination of an Sp-1 
binding site and an Ets binding site in close proximity was known in the art at the time of filing 
to regulate epithelium-specific gene expression {See published application, paragraph [0066]). 
Applicants have experimentally demonstrated that the claimed nucleic acid sequence could 
mediate expression of a nucleic acid of interest in an epithelial-specific manner (paragraph 
[0068]). Further, Applicants provide herewith Attachments A-D, review articles dated prior to 
Applicants' priority date, that describe Ets and Sp-1 transcription factors and their DNA binding 
regions (Berg (1992) Proc. Natl. Acad. Set U.S.A. 89:11109; Macleod et al. (1992) Trends 
Biochem. Sci. 17:251; Lania et al. (1997) Int. J. Biochem. Cell Biol. 29:1313; Suske (1999) Gene 
238:291). 
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Applicants submit that at the time of filing, one of skill in the art of transcriptional 
regulation would readily understand that tissue-specific expression is not conferred by stretches 
of hundreds of nucleotides per se, but rather by the presence of combinations of short, cw-acting 
sequences capable of binding transcription factors. The skilled person would appreciate, based 
on the teachings of Applicants' specification, that not each and every individual nucleotide of the 
claimed nucleic acid sequence would be responsible for mediating transcriptional regulation. 
Instead, one of skill in the art would recognize that regulatory elements, such as the claimed Ets 
and Sp-1 binding sites identified within this region, would be critical for epithelial-specific 
expression. 

Applicants' specification adequately describes the claimed nucleic acid sequence by 
providing SEQ ID NO: 5 and describing several specific Ets binding site and Sp-1 binding site 
sequences (Figure 1). The specification provides protocols that may be used by one of skill in 
the art to determine whether a claimed nucleic acid sequence functions to mediate expression of 
a nucleic acid of interest in an epithelial-specific manner. The specification teaches how to 
construct the claimed nucleic acid sequence (paragraphs [0050]-[0053]), and how to express 
these constructs in a variety of cells (paragraphs [0042]-[0045]) and in transgenic mice 
(paragraphs [0054]-[0057] and [0070]). The specification describes an assay in which epithelial- 
specific expression may be detected (paragraph [0068]). Thus, one of skill in the art would 
understand that the specification sufficiently describes the claimed nucleic acid sequence. 

The specification must be considered as a whole when determining whether the written 
description requirement is met. In re Wright, 866 F.2d 422, 425, 9 U.S.P.Q.2d (BNA) 1649, 
1651 (Fed. Cir. 1989). The knowledge of one skilled in the art also must be considered, because 
the specification must "indicate[s] to persons skilled in the art that as of the [filing] date the 
applicant had invented what is now claimed." All Dental Prodx LLC v. Advantage Dental 
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Products Inc., 309 F.3d 774, 779, 64 U.S.P.Q.2d (BNA) 1945, 1948 (Fed. Cir. 2002). When 
read as a whole, taking into account the knowledge of persons skilled in the art at the filing date 
of the instant application, this specification indicates to those skilled in the art that Applicants 
had possession of the claimed subject matter at the time of filing. Accordingly, the Examiner is 
respectfully requested to reconsider and withdraw this rejection of claims 1, 2, 4-9 and 14 under 
35 U.S.C. § 1 12, first paragraph, as failing to comply with the written description requirement. 

V. Claim 14 is Enabled 

At page 5 of the instant Office Action, claim 14 stands rejected under 35 U.S.C. § 112, 
first paragraph, as failing to comply with the enablement requirement. Applicants respectfully 
traverse this rejection based on the amended claim now presented. 

35 U.S.C. § 112, first paragraph requires that the specification must enable a person 

skilled in the art to make and use the claimed invention. However, a specification need not, and 

should not, disclose what is well known in the art. The invention that one skilled in the art must 

be enabled to make and use is that defined by the claims of the particular application. The issue 

of adequate enablement depends on whether one skilled in the art could practice the claimed 

invention without undue experimentation. Enablement is not precluded by the necessity of some 

experimentation such as routine screening, even if it is extensive routine screening. Also, the 

fact that experimentation may be complex does not necessarily make it undue, if the art typically 

engages in such experimentation (MPEP 2164.01) if the level of skill in the art is high or if all of 

the methods needed to practice the claimed invention are well known. In re Wands, 8 U.S.P.Q. 

2d 1400, 1406 (Fed. Cir. 1988). 

The determination of what constitutes undue experimentation in a given case 
requires the application of a standard of reasonableness, having due regard for the 
nature of the invention and the state of the art. (Citations omitted). The test is not 
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merely quantitative, since a considerable amount of experimentation is 
permissible, if it is merely routine, or if the specification in question provides a 
reasonable amount of guidance with respect to the direction in which the 
experimentation should proceed. In re Wands, 8 U.S.P.Q. 2d at 1404. 

Without acquiescing to the rejection, Applicants respectfully submit that claim 14 was 
amended include the distinguishing feature that the medicament for treating epithelial cancer 
includes an isolated or recombinant nucleic acid sequence according to claim 1 having a nucleic 
acid sequence of interest that is a suicide gene. Applicants respectfully submit that the instant 
specification provides ample direction and guidance to make and use the claimed medicament. 

Determining whether a nucleic acid sequence has the claimed sequence identity, the 
claimed binding sites, and is expressed in an epithelium-specific manner would involve only 
routine screening. For at least the reasons set forth above, Applicants respectfully submit that 
the instant specification teaches the nucleic acid sequence of SEQ ID NO: 5, the sequences of Ets 
and Sp-1 binding sites, that these binding sites could modulate epithelial-specific gene 
expression, and that sequences of many such binding sites were known in the art at the time of 
filing. Applicants teach the use of suicide genes and their prodrugs, such as thymidine 
kinase/gancyclovir and cytosine deaminase/fluorocytosine, and that suicide genes were known in 
the art at the time of filing (paragraphs [0007] and [0019]). Accordingly, based on these 
teachings, one of skill in the art could easily make and use the claimed medicament. 

Further, Applicants submit that using a combination of gene therapy and suicide genes to 
treat cancer was known in the art at the time of filing. Merely two months after Applicants' 
priority date of March 1, 2000, Springer and Niculescu-Duvaz published a review of the state of 
the art of suicide gene therapy (Attachment E, J. Clin. Invest. (May 2000) 105:1 161, Applicants' 
priority date). Attachment E teaches that a gene from a foreign enzyme can be delivered and 
targeted to a tumor where it is to be expressed, and that an administered prodrug can be 
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selectively activated by the foreign enzyme expressed in the tumor (page 1 161, third paragraph). 
This review states that the use of transgenes to encode enzymes that activate specific prodrugs to 
create toxic products ("GDEPT") i.e., "suicide genes", "have already shown efficacy in vivo" 
(page 1161, second paragraph; page 1166, third full paragraph). In fact, gene therapy using 
suicide genes was being utilized in more than 25 clinical trials at the time this review was 
published (Table 1, page 1 162). 

The Office Action, at page 5, states that immune response is an obstacle of gene therapy 
and indicates that the claims are not limiting the medicament with a DNA-based vector or 
therapeutic genes that would not elicit an immune response. The Springer and Niculescu-Duvaz 
review, however, does not report host immune system rejection of suicide transgenes. Instead, 
this reference indicates that the immune response increases the efficacy of gene therapy 
(paragraph bridging pages 1 165-66 and first full paragraph of page 1 166). 

For at least these reasons, Applicant's specification, coupled with the level of skill in the 
art, enables a person of skill in the art to make and/or use the claimed invention. Accordingly, 
the Examiner is respectfully requested to reconsider and withdraw the rejection of claim 14 
under 35 U.S.C. § 1 12, first paragraph, as lacking enablement. 
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VI. CONCLUSION 

Having addressed all outstanding issues, Applicants respectfully request reconsideration 
and allowance of the case. To the extent the Examiner believes that it would facilitate allowance 
of the case, the Examiner is requested to telephone Applicants' attorney at the number below. 

Respectfully submitted, 

Datedi Sj^fe^ £ A-C&Z^ ^ VpU^/fc^^ 

I * JMith L. Stone-Hulslander, Reg. No. 55,652 

BANNER & WITCOFF, LTD. 
28 State Street, 28th Floor 
Boston, MA 02109 
(617) 720-9600 
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Commentary 

Sp1 and the subfamily of zinc finger proteins with guanine-rich 
binding sites 

Jeremy M. Berg 

Department of Biophysics and Biophysical Chemistry. The Johns Hopkins University School of Medicine, Baltimore, MD 21205 

The Cys2His2 zinc finger proteins are a bound to a cognate oligonucleotide have decreased affinity of the Spl peptide for 
class of DNA binding proteins that con- provided the most detailed structural in- two mutant sites reported by Kriwacki et 
tain sequences of the form (Tyr,Phe)- formation to date concerning zinc finger al. (21). Finally, this analysis provides a 
Xaa-Cys-Xaa 2 ,4-Cys-Xaa 3 -Phe-Xaa5- protein-DNA recognition (12). Three rationalization for the asymmetry of the 
Leu-Xaa2-His-Xaa 3 _5-His, usually in tan- amino acids per domain play the most contributions to overall binding affinity 
dem arrays (1-4). Each of these direct role in determining site preference, noted by Kriwacki et al. since the first 
sequences binds a zinc(II) ion to form a These are indicated as Xaa A , Xaa B , and domain (which contacts the 3' end of the 
structural domain termed a zinc finger. Xaa c in the sequence below: binding site) has only Xaa B = His pre- 

Structural studies by NMR (5-11) and rrvr Phe)-Xaa-Cvs-Xaa, .-Cvs-Xaa,- dicted t0 make a du * ectl V Precedented 
x-ray crystallography (12) have revealed 1 1 yr ' ne} Aaa uys Aaa2 4 ^ ys Aaaj contact. While it is likely that Xaa A = Lys 
that these domains adopt very similar Phe-Xaa-Xaa A -Xaa-Xaa-Xaa B - also hydrogen bonds to a base, Xaa c = 
structures that consist of a B hairpin . _.. v __ „ c „. v __ „. Ala is obviously incapable of such an 

followed by a helix. Yet, because of ^u-Xaa-XaaC-H,s-Xaa 3 _5-H.s. interaction . 

variations of certain key amino acids The first and third fingers of Zif 268 have This observation raises one of the re- 
frorn one zinc finger to the next, each Xaa A = Arg, Xaa" = Glu, Xaa c = Arg. maining puzzles about Spl-DNA inter- 
domain makes its own unique contribu- Each of these binds the triplet GCG with actions. If Xaa c = Ala does not interact 
tion to DNA binding affinity and sped- the two Arg residues forming a pair of directly with the DNA, why is the under- 
ficity. hydrogen bonds with guanine and the Glu lined guanine in 5'-GGG-GCG-QGG so 

The most well-understood members of interacting indirectly with cytosine. Note well conserved in the known Spl binding 
this class of proteins are a subset that that the Arg in Xaa* interacts with the sites? This position is a thymine in sev- 
bind to relatively guanine-rich binding guanine at the 3' end of the triplet. The era! of the known high-affinity sites but 
sites. This subset includes Spl (13), the second domain of Zif268 has Xaa A = no direct studies have been reported that 
Zif268/NGFI-A/Krox-20,24/Egrl ,2/ Arg, Xaa B = His, Xaa c = Thr. This finger bear on the effects of mutations at this 
Wilm tumor family (14-19) and yeast interacts with the sequence 5'-TGG-3' in position on Spl binding affinity. It is 
ADR1 (20). Kriwacki et al. (21) report the cocrystal structure via two hydrogen possible that changes in this site do not 
studies of the DNA binding domain of bonds from the Arg to the 3' guanine and affect affinity significantly or that gua- 
Spl that consists of three zinc finger one hydrogen bond between the His and nine is preferred but is recognized by a 
domains. They demonstrated three major N7 of the central guanine. more indirect mechanism, 

points, (i) They showed that a 92-amino The three zinc finger domains (shown, These studies illustrate one approach 
acid peptide corresponding to the zinc in order, fingers 1-3) of Spl have poten- to investigating specific protein-DNA in- 
finger region of Spl with very little flank- tial contact residues teractions— namely, examination of the 
ing sequence specifically bound DNA , . e _ ability of a given protein to distinguish 

containing sites that approximated the ' ' - ms,*a» - »ia between different binding sites. An alter- 

Spl binding site 5'-GGGGCGGGGC-3'. Xaa* = Arg,iaa» = Glo,iaac = Arg native approach is to investigate the ef- 
(/7) They demonstrated the ability of this XaaA _ Iaa , _ glg IaaC _ fects of changes in the protein sequence 

peptide to distinguish between different ' on DNA binding affinity and specificity. 

DNA sequences in a manner reminiscent Assuming that the corresponding con- Change-of-specificity mutants (in which 
of that of intact Spl. (/«') They showed tacts from the Zif 268 structure apply to DNA binding ability is not simply dis- 
that the 5'-GGGGCG region of the bind- Spl, the binding site can be predicted to rupted but changed to a different pre- 
ing site contributed more to the overall be 5'-NGG-GCG-NGN-3', where the 5'- ferred DNA sequence) are particularly 
binding affinity than does the GGGC-3' NGG triplet is contacted by the third useful. The first such mutants were re- 
region, finger, the central GCG is contacted by ported by Nardelli et al. (29) for Krox-20, 
These observations can be interpreted the second finger, and the NGN triplet is a close homologue of Zif 268, with three 
in light of other reports. Several groups contacted by the first finger. The His zinc finger domains and the same contact 
have demonstrated that fragments of Spl residues in position Xaa" are likely to residues. The wild-type protein was 
that included the zinc finger domains bind well to adenine as well as guanine shown to bind 5'-GCG-GGG-GCG-3' but 
showed DNA binding properties very since the observed hydrogen bond in- not5'-GCG-GCG-GCG-3'. In the second 
similar to those of the intact protein (22- volved N7. This has been directly dem- domain, Xaa" was changed from His to 
26). The smaller fragment studied by Kri- onstrated for an Spl variant with Xaa" = Glu and Xaa c was changed from Thr to 
wacki et al. (21) is the smallest polypep- His in the second domain (27). Thus, the Arg. The double mutant showed the re- 
tide studied to date beginning only 7 predicted site can be written 5'-N(G/ verse specificity for the two DNA se- 
residues before the first Cys residue and A)G-GCG-N(G/A)N-3'. This clearly is a quences above. Obviously, the two mu- 
;xtending only 3 amino acids past the last good approximation to the Spl consen- tations put the same contact residues in 
His residue. sus (23, 28). Moreover, it accounts for the the second domain that were already 
The crystallographic studies of the many aspects of the variations in the found in the first and third domains. 
:hree zinc finger domains of Zif 268 known Spl binding sites as well as the Thus, binding to 5'-GCG-GCG-GCG-3' is 
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not surprising but it is important to real- 
ize that these experiments were pub- 
lished prior to the report of the Zif 268 
cocrystal structure. Other mutants stud- 
ied resulted in reduction in the selectivity 
of DNA binding. 

A more extensive series of change-of- 
specificity mutants of Spl itself has been 
developed by Desjarlais and Berg (25- 
27). These involved changes in the sec- 
ond zinc finger domain. The first of these 
changed the binding specificity from 5'- 
GGG-GCG-GGG-3' to 5'-GGG-GCI- 
GGG-3' via mutation ofXaa A from Arg to 
Gin and Xaa B from Glu to Asp as well as 
one additional change in a residue that 
interacts with the Xaa A side chain (25, 
26). The fact that three amino acid se- 
quence changes were required to change 
the preferred binding site in a single po- 
sition illustrates the necessity to consider 
the entire recognition helix; only certain 
sets of contact residues are mutually 
compatible. An additional mutant was 
produced by changing Xaa 8 from Asp to 
Asn in the above context. This mutant 
binds to 5'-GGG-GAl-GGG-3'. A single 
change of Xaa B from Glu to His in the 
wild-type Spl background bound to 5'- 
GGG-G( A/G)G-GGG-3 ' . This is the in- 
verse of one of the Krox-20 
discussed above. Finally, a mutant with 
Xaa* changed from Glu to Leu bound 
5'-GGG-G(A/C/T)G-GGG-3' . 

A final member of this protein subfam- 
ily that has been extensively studied is 
yeast ADR1. This protein has two zinc 
fingers in its DNA binding domain and it 
generally binds as a dimer on sites with 
approximate dyad symmetry (30). The 
two putative contact residues (shown, in 
order, fingers 1 and 2) are 

Xaa* = Arg,Xaa» = Bis, Xaa' = Arg 

Xaa* = arg, Xaa 1 = Leu,Xaa c = Arg. 

One of the preferred binding sites is 5'- 
TTG-GAG-3'. These contacts are con- 
sistent with those above except that the 
Arg in position Xaa c in the second finger 
abuts thymine rather than guanine. Three 
change-of-specificity mutants were pro- 
duced via an extensive study of muta- 
tions in the first zinc finger (31). Changing 
Xaa A from Arg to Gin produced a protein 
that preferred the site 5'-TTG-GAA-3'. It 
is interesting to note that the preference 
associated with the Xaa A = Gin differs 
from that observed in the Spl mutant 
above. Glutamine is notable for its ability 
to interact with DNA bases in different 
ways; in the cocrystal structure of 434 



repressor bound to DNA, three Gin res- 
idues make three different types of con- 
tact with DNA (32). A second mutant 
involved changing Xaa B from His to Thr. 
This mutant protein preferred 5'-TTG- 
GC.G-3'. Finally, a mutant involving a 
change of Xaa c from Arg to Asn pre- 
ferred 5'-TTG-AAG-3'. These results on 
ADR1 confirm and extend the results 
from the Spl mutants. 

Because of the apparent modular na- 
ture of the zinc finger domains in proteins 
of this class, it has been very tempting to 
hope that codes could be deduced that 
interrelate the amino acid sequences of 
zinc finger domains and the sequences of 
their preferred binding sites. This would 
allow prediction of binding site se- 
quences for naturally occurring zinc fin- 
ger proteins and design of zinc finger 
proteins with preselected DNA binding 
properties. As the above discussion indi- 
cates, considerable progress toward this 
goal has been made for this small subset 
of zinc finger proteins that interact with 
guanine-rich binding sites. However, it is 
important to note that many zinc finger 
proteins have amino acids in the potential 
contact positions whose interactions with 
DNA have not been elucidated. Further- 
: finger proteins such as 
Drosophila hunchback have been shown 
to interact with very (A+T>rich binding 
sites (33, 34). Future studies should re- 
veal whether similar rules can be devel- 
oped for these protein-DNA interac- 
tions. 
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REVEWS 



The ets 2ene family 



THE \-ets ONCOGENE, the first member 
of the eft gene family to be described, 
was discovered in the E26 ('E-fwenty- 
six') acutely transforming retrovirus of 
chicken, from which it derives its 
name 12 . Other family members cloned 
from a variety of organisms 1 " 19 , are listed 
in Table I. In most family members, the 
conserved DNA-binding domain (the 
ETS domain), is localized at the car- 
boxyl terminus of the protein, with the 
exception of Elk-1, SAP-1 and Elf-1, 
where it is found at the amino termi- 
nus 91016 ' 17 . The ETS domain, which cov- 
ers approximately 85 amino acids (Fig. 
1), has no structural homology to other 
known DNA-binding motifs such as the 
zinc finger, homeodomain, leucine zip- 
per or helix-turn-helix motifs 6 but has 
limited homology to the DNA-binding 
domain of the c-myb oncogene product. 
In c-Myb, nine Trp residues are repeated 
every 18 or 19 amino acids around a 
predicted a-helical structure 6 , whereas 
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Ets proteins have a conserved DNA-binding domain and regulate transcrip- 
tional initiation from a variety of cellular and viral gene promoter and 
enhancer elements. Some members of the Ets family, Ets-1 and Ets-2, 
cooperate in transcription with the AP-1 transcription factor, the product of 
the proto-oncogene families, fos and jun, while others, Elk-1 and SAP-1, 
form ternary complexes with the serum response factor (SRF). Certain ets 
gene family members possess transforming activity while others are acti- 
vated by proviral integration in erythroleukaemias. 



the ETS domain contains only two or 
three conserved Trp residues (Fig. I) 6 . 

The sequence conservation of the 
proteins and the chromosome locations 
of the genes provide clues to gene 
duplication and divergence events that 
gave rise to this gene family. For ex- 
ample, ets-1 and fli-1 both map to the 
same region of human chromosome 
1 lq23, while ets-2 and erg both map to 
21q22 (Refs 2,11). However, the hom- 
ology between ets-1 and ets-2 and 
between erg and fli-1 is greater than the 



homology between those genes that 
map next to each other on the same 
chromosome 211 . This suggests that du- 
plication of an ancestral eft-related 
gene may have occurred, giving rise to 
two sister genes, which subsequently 
diverged over time. This was followed 
by duplication of the entire locus to 
another chromosome. 

Regulation of ets expression and activity 

The expression of eft genes is con- 
trolled at several levels: the initiation of 



Ets-1 GSGP-IQI W 2FLLELLTDKS-CQSFI£ W r-GDGWEFKLSDPDEVA- -RI W 3KRK-NKPKMNYEKLSRGLRYYYDKNIIHKTAGKRYVYRFV 

Ets-2 GSGP-IQI W 2FLLELLSDKS-CQSFIS W T-GDGWEFKLADPDEVA--RI W 3KRK-NKPKMNYEKLSRGLRYYYDKNIIHKTSGKRYVYRFV 

GABP- a NNGQ-IQI W 2FLLELLTDKD-ARDCIJ W /GDEEGEFKLWQPELVA- -Qi W 3QRK-NKPTMNYEKLSRALRYYYDGDMICKVQGKRFVYKFV 

Erg GSGQ-IQI W 2FLLELLSDSS-NSSCn W S-GTNGEFKMTDPDEVA- -RE W 3ERK-SKPNMNYDKLSRALRYYYDKNIMTKVHGKRYAYKFD 

Fli-1 GSGQ-IQI W 2FLLELLSDSA-NASCn W 2-GTNGEFKMTDPDEVA--RI W 3ERK-SKPNMNYDKLSRALRYYYDKNIMTKVHGKRYAYKFD 

D-Elg NNGQ-VQI W 2FLLEILTDCE-HTDVII W f-GTEGEFKLTDPDRVA- -RI W 3EKK-NKPAMNYEKLSRALRYYYDGDMISKVSGKRFAYKFD 

E74A GST--TYI W 3FLLKLLQDREYCPRFII W r/NREKGVFKLVDSKAVS- -RI W 3MHK-NKPDMNYETMGRALRYYYQRGILAKVDGQRLVYQFV 

Elf-1 GNT--IYI W SFLLALLQDKATCPKY 1 1 W r/QREKG I FKLVDSKAVS - - RI W 3KHK - NKPDMNYETMGRALRY Y YQRG I L AKVEGGRLVYQF - 

SAP-1 -MDSAITI N 2FLLQLLQKPQ-NKHMIC W FS-MDGQFKLLQAEEVA- -RI W 3IRK-NKPNMNYDKLSRALRYYYVKNIIKKVNGQKFVYKFV 

Elk-1 PS VTI W 2FLLQLLREQG-NGHIIS W CSADGGEFKLVDAEEVA- -AI W 3LAK-NKTNMNYDKLSRALAYYYDKNIIIAKVSGQKFVYKV 

PEA 3 QRRGALQI W 2FLVALLDDPT -NAHFIi W TG-RGMEFKLIEPEEVA— RI W 3IQK-NRPAMNYDKLSRSLRYYYEKGIMQKVAGERYVYKFV 

PU-1 GSKKKIRLYQFLLDLLRSGD-MKDSIV W /DKDKGT FQF S S KH K EALAHI W 3IQKGNRKKMTYQKMARALRNYG-KTGEVKKVKKKLTYQFS 



Alignment of the ETS domain primary amino acid sequences. The conserved Trp residues (W)'are boxed. Other highly conserved amino acids 
are indicated by asterisks (*), while conserved basic residues are indicated by (B). 
© 1992, Elsevier Science Publishers, (UK) 0376-5067/92/$05.00 251 



transcription 22 , alternative splicing 45 , 
post-translational modification 7 and pro- 
tein stability 23 . The expression patterns 
of different members of the ets gene 
family vary between tissues (Table I). 

In T cells, expression of ets-1 is highest 
in quiescent cells and decreases follow- 
ing T-cell activation 20 . Conversely, ets-2 
gene expression increases from a low 
level following T-cell activation and 
reaches maximum levels 1-2 h after 
induction 20 . The mechanism of coordi- 
nation of the expression of these two 
genes during T-cell differentiation is 
unclear. 



Regulation of transcriptional initiation 
is best understood for the human ets-2 
gene. Like all other ets promoters 
described, and many other oncogene 
promoters, the ets-2 promoter lacks 
the classical TATA and CCAAT box se- 
quences found in many RNA pol- 
ymerase H-transcribed genes. A puta- 
tive Ets-binding site (EBS) has been 
identified downstream of the promoter 
(Table 10 and ets-2 may be negatively 
autoregulated. Adjacent to this EBS, 
there is a putative activator protein 1 
(AP-1) site and factors bound at these 
sites cooperate in /rans-activation. 



These EBS and AP-1 sites bind activator 
and repressor factors in a cell-specific 
fashion 22 . 

The promoter of the human ets-1 
gene contains putative AP-l, EBS, AP-2 
and Spl sites. Expression of exogenous 
AP-2 and Ets-1 increases the* level of 
transcription from the ets-1 promoter, 
implying an activating role for AP-2 in 
ets-1 expression and positive autoregu- 
lation of ets-1 transcription 3 24 . 

Chicken and human ets-1 RNAs are 
alternatively spliced 4,5 . In chicken, two 
forms of Ets-1 mRNA are generated by 
alternative splicing of 5' exons: a and P 



Table I. The ets gene family 



Amino acid 
homology to 
ETS domain of 
Ets-1 (%) 



Human 

chromosomal 

location 



Expression/features 



human 
mouse 
chicken 



39-52 
63 

54/68 
58/62 



Elevated expression in thymus and endothelial cells; phosphorylated; 
alternatively spliced; positively autoregulates transcription 1 " 6 . 



41/52 



21q22 
llq23 



d; 98% homologous to Fli-1 6,8 . 
Activated by proviral insertion of Friend MuLV; 98% homologous to Erg 11 . 



Elk-1 human 60 

SAP-1 human 58/5 
a/b 



E74A/B Drosophila 110/120 50 



ETS domain located in the amino terminus of the protein; forms ternary 
complex with SRF; shows three regions of homology with SAP-1 6,9,10 . 

SRF accessory protein 1, which, like Elk-1, forms a ternary complex with 
SRF over the c-fos SRE; contains three regions of homology to Elk-1, including 
the ETS domain, which is located in the amino terminus of the protein; the 
two isoforms, SAP-la and SAP-lb, differ in their carboxyl termini 17 . 



expression of E74B, which is 



Elf-1 human 68 50 ND The ETS domain is the human homologue of the E74A protein of Drosophila; 

binds to the NF-AT and NF1L-2B sites in the interleukin-2 promoter and the 
human immunodeficiency virus 2 LTR 16 . 

GABP-a rat 51 82 ND High-level expression in rat thymus; complexes with GABP-p\ which contains 

ankyrin repeats, and is related to the Notch protein 13,14 . 

D-EIg Drosophila 15 64 Drosophila Contains only a DNAWnding domain; maternally expressed message and also 

chromosome expressed throughout embryogenesis 15 . 

3R97D 

PEA3 mouse 68 63 ND Expressed in mouse brain and epididymis and in fibroblast and epithelial cell 

lines; down-regulated in embryonic cell lines in response to retinoic-acid- 
induced differentiation 18 . 

TCFl-a human 55 . ND ND Very limited homology to ETS domain exists within the HMG box of this factor; 

expression is restricted to the thymus and is induced following T-cell 
activation; regulates activity of the TCRa enhancer 19 . 



Abbreviations: ND, not determined; MuLV, murine leukaemia virus; SFFV, spleen focus forming virus; SRF, serum response factor; NF-AT, nuclear factor of activated 
T cells; NFIL-2B, nuclear factor of interleukln 2B; LTR, long terminal repeat; GABP-p, GA-binding protein; PEA3, polyomavirus enhancer activator 3; HGM, high 
mobility group; TCR, T-cell receptor. 
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of p68 are replaced by I 54 in p54 (Fig. 2). 
Unlike p54, p68 is expressed in endo- 
thelial cells rather than in the thymus 
or T cells 4 . The differential functions of 
these proteins are thought to be associ- 
ated with the different /rans-activation 
domains encoded by the 5' exons 25 . 
Exons a and p have not been identified 
in any mammalian species (0. Albagli, 
pers. commun.), but in human ets-1, 
alternative splicing of exons IV and VII 
has been observed 5 . Exon VII encodes a 
domain that has a negative regulatory 
effect on the DNA-binding domain, 
which may be the result of phosphory- 
lation of Ser residues in this region 5 . 
Ets-2 also has this inhibitory domain 
but PU-1 lacks such a domain 46 . 

Phosphorylation of both chicken and 
human Ets-1 and Ets-2 results in the loss 
of non-specific DNA-binding activity 23 . 
The cellular kinases responsible for 
phosphorylating Ets-1 and Ets-2 pro- 
teins are unknown, although putative 
sites of phosphorylation for protein ki- 
nase C, casein kinase II and calmodulin 
kinase have been identified. Following 
activation of T cells, in addition to the 
reduced level of ets-1 transcription, 
there is an increase in phosphorylation 
of Ets-1 that results in reduced DNA- 
binding activity 23 . The half-life of Ets-2 
increases following T-cell activation 
from approximately 20 min to greater 
than 2 h as a consequence of post- 
radiational modification 26 . Protein 
kinase C (PKC) inhibitors block this ef- 
fect, suggesting that PKC directly or in- 
directly modifies the stability of Ets-2 
(Ret. 26). 

PEST sequences (regions rich in Pro, 
Glu, Ser and Thr), may play a role in 
protein turnover and are found in both 
the transcription factor PU-1 (from 
amino acids 121-159) 12 and p68"' J - ; 
(from amino acids 295-326). 'The PEST 
sequences are located between puta- 
tive frans-activation domains and DNA 
binding domains and cleavage may cre- 
ate a DNA-binding domain free from the 
regulatory restraints imposed by 
amino-terminal domains. 

The Ets-binding site 

The p68 Mtt -' protein binds to the 
PEA3 element of the polyoma virus 
enhancer 27 . The DNA sequence 
AGCAGGAAGT is specifically recog- 
nized by p68 MB "' in the polyoma en- 
hancer, and binds p68 c "* ft "' with mod- 
erate affinity as determined by electro- 
phoretic mobility shift assay (EMSA). 
Mutational analysis of the PEA3 element 
has identified several higher-affinity 



Table II. Regula 


itory elements contain 


ng Ets-Wndlng sites 


Regulatory element 


EBS sequence 


Ets protein known to bind 


ets-2 promoter" 


TGGAGGAAGT 


Ets-l/Ets-2 


Interleukin 2 enhancer (NFAT-1) 13 


AAGGAGGAAA 


Elf-1 


polyomavirus enhancer (PEA3) 27 


AGCAGGAAGT 


Ets-l/Ets-2/Erg/Elk-l 


Moloney Sarcoma Virus LTR 32 


GAGCGGAAGC 


Ets-l/Ets-2 


SV40 enhancer 12 


AAGAGGAACT 


PU-1 


ICP4 promoter 13 " 


6X(CGGAAA/G) 8 


GABP-a 

(binding depends on GABP-0) 


HTLV 1 LTR 31 


GGAGGAAAT 


Ets-l/Ets-2 


HTLV 1 LTR 31 


CCGGGAAGC 


Ets-l/Ets-2 


stromelysin 1 promoter 28 


GCAGGAAGC 


Ets-l/Ets-2 


stromelysin 1 promoter 28 


CCAGGAAAT 


Ets-l/Ets-2 


c-fos promoter 10 


CAGGATGT 


SAP-1 

(binding depends on SRF) 


T-cell receptor a enhancer 33 


CAGAGGATGT 


Ets-1 


Abbreviations: EBS, Ets-binding site; HTLV, human T-cell leukaemia virus. 
"X = any nucleotide. 



EBS, such as AGCCGGAAGT 2829 , in 
which the second A has been replaced 
by C. Selection and amplified binding 
studies identified further mutations 
that increase the binding affinity of the 
EBS for p54 c - e *-', such as a G-»A tran- 
sition in addition to the A-»C trans- 
version to generate AACCGGAAGT 30 . 
p54«*', p58 MW , Erg and Elk-1 show 
similar affinities for the same se- 
quences as p68 we -' (Refs 8,9,28) while 
P30"* ' and Elf-1 appear to have different 
sequence specificities 16 ' 28 . 

EBS have been identified in other 
cellular and viral gene regulatory se- 
quences (Table 10 including the long 
terminal repeats (LTR) of the human T- 
cell leukaemia virus 1 (HTLV-I) 31 , the 
Moloney Sarcoma Virus (MSV) LTR 32 
and the rat stromelysin 1 promoter 28 . A 
combination of EMSA and methylation 
interference studies on the binding of 
Ets proteins to the MSV 32 and HTLV-I 31 
LTRs has shown that the GGAA purine 
core is essential for the specific binding 
of Ets-related proteins to these se- 
quences. However, this is not the case 
for sites within certain T-cell-speclfic 
enhancers, such as the enhancers of 
the genes encoding T-cell receptor- 
oCTCRkx) 33 and -B(TCR-p) 17 . In these 
cases, an A-»T transversion in the 
purine core to generate GGAT is fre- 
quently observed (Table H). 

When viral point mutations and the 
carboxyl terminus of the v-ets protein 



were introduced into a 35 kDa version 
of the pSS™"' 1 protein, the altered pro- 
tein lost its ability to bind to the EBS 34 . 
This suggested that the v-ets protein 
has either a different DNA-binding 
specificity to p68 MB -' or does not bind 
DNA at all 34 . Conflicting evidence has 
emerged showing that the v-ets protein 
has a stronger binding affinity for the 
EBS than the full-length protein 
and that cleavage of c-Ets-1 generates a 
truncated protein that binds strongly to 
the EBS 35 . It has been suggested that 
there is a negative regulatory domain in 
the amino terminus of Ets-1 that inter- 
acts with the carboxyl terminus of the 
protein to prevent DNA binding. Cleav- 
age of the protein would repress DNA- 
binding by p68 WB - 7 (Ref. 35). The v-Ets 
protein has a different carboxyl ter- 
minus, so it has been proposed that the 
negative regulatory domain present in 
this viral protein would be redundant, 
since it would be unable to associate 
with carboxy-terminal sequences which 
are present only in the cellular pro- 
tein 35 . It will be interesting to determine 
whether the viral fusion protein active in 
erythroleukaemia, plWvW"*, binds 
to the EBS or not. 

Ets target genes 

The identification of EBS in different 
promoters/enhancers has permitted 
identification of genes that may be regu- 
lated by Ets-related proteins; the ex- 
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1 Sgag | 5myb |g | p | a | b |cl| 



81 120 201 78 249 



119 213 nts 



EM 



(a) Genomic organization of the chicken c-ets-1 locus. Chicken exons (a, (3, I 5 ", a, b, cl, 
c2, d, e and F) are shown as black boxes, along with an alternative nomenclature used for 
the chicken and human exons (I, II, A, III, IV, V, VI, VII, VIII, IX). Two translation start sig- 
nals (ATG) are found in exons a and I 54 , while the stop signal (TGA) is in exon F. (b) The 
135 kDa viral fusion protein of the E26 virus showing the organization of ets sequences 
relative to those of gag and myb. The a and p exons, but not the I 54 exon, have been trans- 
duced. The positions of non-conservative mutations (V) in v ets, compared with c-e£s-l [see 

(c) ] and a change in carboxy-terminal sequences (El) are shown, (c) The organization and 
sizes of the different exons in the alternatively spliced c-ets-1 mRNAs that give rise 
to p54 c " e,sl and p68 cets:I . The number of nucleotides (nts) in each exon is indicated. 

(d) Functional domains of p68 cetsI . The amino acid residues are numbered and aligned 
with the nucleotide sequences of the p68 mRNA in (c) above, thus indicating the exons 
that code for each functional domain of the protein. Rl, region I (trans-activation domain); 
Rll, region II (regulatory domain); Rill, region III (trans-activation domain); DBD, DNA-bind- 
ing domain; HLH, helix-loop-helix domain; PEST, region rich in Pro, Glu, Ser and Thr 
residues which may target protein cleavage to this region of the protein; NTS, nuclear tar- 
geting signal. 



pression of these genes often correlates 
with the expression of Ets-related pro- 
teins. A significant number of lymphoid- 
specific genes contain EBS in their regu- 
latory sequences, including the en- 
hancers of the T-cell receptor-a 33 and 
the T-cell receptor-p (M. Owen, pers. 
commun.). The EBS of the TCR-p 
enhancer mediates both basal and 
phorbol ester/T-cell activator-induced 
expression of the gene encoding TCR-p. 
Expression of Ets-1 represses both 
basal and inducible activity of the 
TCR-P enhancer, while expression of 
Ets-2 has no effect on enhancer- activity. 
Derepression of 7U?-p transcription may 
be mediated by inactivation of Ets-1 
DNA-binding activity, possibly by phos- 
phorylation. 
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A new member of the ets gene family, 
elf-1, has recently been cloned from a 
human T-cell library and shown to bind 
to the EBS in both the interleukin- 2 flL-2) 
enhancer and the human immuno- 
deficiency virus-2 (HrV-2) LTR 16 . The IL-2 
enhancer contains two EBS, previously 
shown to be bound by the NF-AT-1 and 
NF-IL-2B regulatory transcription factor 
complexes 16 . Fos and Jun have recently 
been shown to be present in the NF-AT-1 
complex 45 . Does Elf-1 complex with Fos 
and Jun on the NF-AT-1 site? These 
sites both bind Elf-1 but not Ets-1 or 
Ets-2' 6 . Other EBS have been ident- 
ified in lymphoid-specific promoters/ 
enhancers, such as the IgH enhancer, 
the major histocompatibility complex 
(MHC) II 1-Ap enhancer, the HTLV-I 



LTR, the CD2 enhancer, the CD3 
enhancer, the mb-1 promoter and the 
terminal deoxynucleotide transferase 
promoter. 

Genes involved in the degradation of 
the extracellular matrix (ECM), such 
as those encoding stromelysin- 1, colla- 
genase and urokinase plasminogen acti- 
vator 21 , contain EBS in their regulatory 
flanking sequences. Ets-1 activates ex- 
pression of rat stromelysin 1 (Ref. 28) and 
may thus play a role in coordinating ev- 
ents that lead to degradation of the ECM, 
an important process in metastasis 21 . 

Ets-1 can act either as an activator of 
stromelysin 1 in HeLa cells, or as a 
repressor of TCR-P in quiescent Jurkat 
cells, depending on promoter/enhancer 
context and cell type. Analysis of frans- 
activation of different target genes sug- 
gests a role for Ets-1 in both T-cell 
differentiation and in control of ECM 
degradation. 

Trans-activation by Ets 

7rans-activation by p68"" ' from the 
PEA3 site of the polyomavirus enhancer 
is cooperative with activation by the 
transcription factor, AP-1, bound to the 
adjacent PEA1 site. Cooperativity is not 
mediated at the level of DNA binding 
since altering the DNA binding affinity 
of either the PEA1 or PEA3 sites did not 
alter the ability of these factors to co- 
operate in transcription 27 . Over-ex- 
pression of p^ 15 ■' resulted in inhibition 
of frans-activation by AP-1 of the onco- 
gene response unit, mutated in its PEA3 
site 27 . This suggested that the cooper- 
ativity between AP-1 and peS"*-' is me- 
diated by protein-protein interactions 27 . 
p54««-' and $58*** of chicken also bind 
to the PEA3 site and effect cooperative 
frans-activation with AP-1 26 . Recent evi- 
dence has demonstrated cooperative 
frans-activation by c-Myb and Ets-2 of 
the mim-1 (myfr-mduced myeloid-/) pro- 
moter; however, Ets-1 failed to cooper- 
ate with c-Myb in frans-activation 36 . Co- 
operativity in frans-activation has also 
been observed between two palin- 
dromic EBS in the rat stromelysin 1 
gene promoter 28 . A single EBS does not 
appear to mediate transcriptional acti- 
vation linked to a minimal promoter 37 , 
and, as in the case of p75 c - myb , multiple 
binding sites are required for this acti- 
vation function. 

Domain-swap experiments involving 
fusion of deletion mutants of P68"' 5 " 7 to 
the DNA-binding . domain of bacterial 
LexA have identified three frans-acti- 
vation domains, RI, RII and RIII, in the 
amino-terminal region of peS"'*' (Ref. 



25; Fig. 2). When fused to the LexA DNA- 
binding domain, RI and RII1 can both 
activate transcription from promoters 
containing the lexA operator 25 . RII can- 
not /rans-activate alone, but has a nega- 
tive regulatory effect on the activity of 
RI and a positive effect on the activity 
of R1II. RII, which contains a helix-loop- 
helix motif 3 , is conserved between Ets-1 
and Ets-2, suggesting that RII may me- 
diate common control interactions of 
Ets-1 and Ets-2. Ets-1 and Ets-2 compete 
with LexA-Ets-l/-2 fusion proteins in 
/rans-activation, suggesting that Ets-1 
and Ets-2 proteins interact with a com- 
mon factor. 

The frans-activation domain of PU-1 
has both a Gin-rich region (mapping 
from amino acids 74-93) and an acidic 
region (amino acids 1-165), which may 
act as /rans-activation domains 12 . Re- 
cent work has identified a role for PU-1 
in recruiting other transcription factors 
such as NF-EM5, a B-cell-specific factor, 
to the 3' enhancer of the mouse im- 
munoglobulin kappa light-chain gene, 
where binding to the enhancer by NF- 
EM5 depends upon prior binding of 
PU-1 to the enhancer 12 . The functional 
domains involved in these interactions 
have not been identified. 

The serum response factor (SRF) 
binds to the serum response element 
(SRE) in the promoters/enhancers of 
serum-inducible genes, such as c-fos. 
SRF forms a ternary complex with 
p62 TCF , which is essential for complex 
formation on the SRE. p62 TCF does not 
bind DNA alone. EIk-1 is functionally 
equivalent to p62 TCF , in that it forms a 
ternary complex with SRF 10 . Similar 
results have been obtained with an Elk- 
related protein, SAP-1 17 , which forms a 
ternary complex with SRF over the SRE 
of the c-fos promoter and, like Elk-1, its 
DNA-binding activity depends upon 
interaction with the SRF. Both Elk-1 and 
SAP-1 are members of the Ets family. 
Two domains, including the ETS 
domain of SAP-1, are necessary for 
ternary complex formation 17 . 

A new member of the eft gene family, 
GABP-a, forms the a-subunit of the GA- 
binding protein (GABP), which binds to 
the purine-rich sequence (CGGAAG/A) 
of the ICP4 gene of herpes simplex 
virus 314 . GABP-a mediates DNA binding 
by the multi-subunit protein but GABP-P 
is also necessary for binding 1314 . The 
amino terminus of the p-subunit con- 
tains four 33-amino-acid ankyrin 
repeats found in the Notch, Lin 12, 
cdclO, NF-kB and Ik-B proteins 1314 . 
These 33-amino-acid repeats are necess- 



ary for interaction of GABP-p with 
GABP-a. The ETS domain of the a-sub- 
unit is required for its interaction with 
the p-subunit. 

The transcription factor E4TF1 regu- 
lates transcription of the adenovirus-5 
E4 gene and binds to a sequence 
CGGAAGT, a high-affinity EBS, in the E4 
promoter 38 . E4TF1 is composed of two 
subunits, p53 and p60, and addition of 
purified p53 increases the binding affinity 
of p60 for this site. Neither p53 nor p60 
can frans-activate alone but together 
mediate a tenfold increase in transcrip- 
tion mediated by the CGGAAGT site 38 . 
p60 is a good candidate member of the 
Ets family. 

Ets in transformation 

The v-eft oncogene is co-transduced 
with the v-myb oncogene in the E26 
retrovirus, which expresses a 135 kDa 
fusion protein, ptfS 5 **" 1 ^' 5 (Refs 1,2). 
The acute myeloblastosis virus (AMV), 
which has transduced the v-myb onco- 
gene alone, induces myeloid leu- 
kaemias 39 , whereas E26 induces erythro- 
leukaemias. It was proposed that v-eft 
confers the ability to transform im- 
mature erythroid progenitor cells upon 
the E26 virus. Both v-eft and v-myb 
retain a reduced capacity to transform 
erythroid cells individually, but co- 
expression results in an increased self- 
renewal rate of transformed erythroid 
cells 39 . Clearly, the transforming ability 
of v-eft and v-myb is additive, suggesting 
that v-myb and v-eft oncogene products 
may cooperate in transcription 3639 . This 
transforming activity is superceded by 
the transforming potential of myb-ets 
fusion proteins, which generate erythro- 
leukemia with a much-reduced latency 
and higher frequency 39 . Fusion of the 
v-eft and v-myb oncogenes may gen- 
erate a novel frans-activator with an 
altered target gene specificity or altered 
interactions with other regulatory fac- 
tors. More recent work has shown that 
v-eft can also cooperate with v-erbA in 
transforming erythroid progenitor cells 43 . 

There is a common mechanism that 
results in the activation of two different 
eft genes in virally induced murine ery- 
throleukemias. Proviral integration by 
the Friend Murine Leukaemia Virus 
(Friend MuLV) and the Spleen Focus 
Forming Virus (SFFV) into flanking 
sequences activates expression of the 
fli-1 and spi-1 genes respectively. 

The human eft-7 and ets-2 genes are 
translocated in certain types of acute 
leukaemia from chromosomes 11 to 4 
and from 21 to 8, respectively, although 



neither gene spans the chromosome 
breakpoint 2 . Similarly, the elk-1 gene 
maps close to chromosome breakpoints 
in synovial sarcomas 2 . The significance 
of these translocations for Ets function 
is unclear. 

The role of eft gene family members 
as oncogenes has been tested in classi- 
cal transformation experiments in 
which over-expression of the human 
c-eft-7 or c-eft-2 oncogenes in NIH-3T3 
mouse fibroblasts generates trans- 
formed foci in low serum conditions 3 ' 41 . 
These transformed cells grew in soft 
agar and formed tumours in nude mice. 
The transforming capacity of the c-eft-7 
proto-oncogene may be associated with 
its ability to positively autoregulate its 
own expression 3,41 . 

Increased binding to the PEA3 el- 
ement of the polyoma enhancer occurs 
when fibroblasts are stimulated with 
serum or phorbol esters or transformed 
by v-src, v-raf, polyoma middle T anti- 
gen, v-mos or c-Ha-ras (Ref. 36). This 
induction of binding activity is indepen- 
dent of protein synthesis inhibitors, 
suggesting that it is mediated by post- 
translational modifications, such as 
phosphorylation or interactions with 
cofactors. Recent evidence suggests 
that Raf-1 kinase is directly/indirectly 
involved in activating Ets-driven pro- 
moters following Ras transformation of 
fibroblasts 42 . 

An interesting new aspect concerning 
the role of eft in growth control has 
emerged from studies of Ets-1 expression 
in human endothelial cells 21 . Ets-1 ex- 
pression correlates with the prolifer- 
ation of endothelial cells in normal 
blood vessel formation, associated with 
wound healing and normal develop- 
mental processes, but also with vascu- 
larization of tumours during malignant 
progression 21 . This evidence, combined 
with the ability of Ets-1 to regulate the 
expression of ECM-degrading proteins 
such as stromelysin and collagenase, 
suggests an important role for Ets-1 in 
the metastatic processes associated 
with late tumour development 2128 . 

Perspectives 

The number of members in the eft 
gene family is growing rapidly, the 
encoded proteins of which function as 
regulators of cellular and viral gene 
transcription. Some Ets proteins, such 
as Ets-1 and Ets-2, interact with the 
same DNA-recognition sites and may 
function antagonistically, either acti- 
vating or repressing transcription of the 
same target gene, depending on pro- 
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moter context, cell-type or stage of dif- 
ferentiation. Other Ets proteins, such as 
Elf-1, interact with different, although 
similar sites, and probably regulate dif- 
ferent target genes. How the expression 
and activity of different Ets proteins are 
coordinated with respect to cell-type, in 
differentiation, during the cell cycle and 
in development, is unclear. The capacity 
of certain Ets proteins to interact with 
regulatory cofactors and to be regulated 
by phosphorylation suggests that Ets 
proteins may play a role in coordinating 
changes in gene expression in response 
to second messengers and extracellular 
signals. Ets proteins may be modulated 
differently by interactions with different 
cofactors. The possible role of these 
interactions in the regulatory pathways 
of cell-cycle control and differentiation 
needs further evaluation. 
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Spl is one* of the very first cellular transcription factors to be identified and cloned in virtue of 
its binding to a G-rich motif in the SV40 early promoter. Spl protein binds to the G-rich 
sequences present in a variety of cellular and viral promoters and stimulates their transcrip- 
tional activity. Recently, a number of other GC and/or GT box-binding factors homologous to 
Spl have been isolated, namely Sp2, Sp3 and Sp4, and the two more distantly related factors, 
BTEB and BTEB2. The discovery of this family highlights a previously unknown level of com- 
plexity of transcriptional regulation of promoters containing GC and/or GT box motifs. This 
review focuses primarily on strategies aimed to elucidate the transcription properties of the 
Spl-like factors and discusses the experimental problems inherent in the attempt to define 
their respective functions. © 1997 Elsevier Science Ltd. All rights reserved 
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INTRODUCTION 

Our knowledge of the transcriptional regulat- 
ory mechanisms governing gene expression is 
accumulating at an accelerating pace. Such 
progress stems largely from the combination 
of biochemical and genetics studies of gene 
transcription. Initiation of messenger RNA 
synthesis is the major site for regulation of 
eukaryotic gene expression. Intense effort has 
been focused on identifying transcription fac- 
tors that control mRNA synthesis by RNA 
polymerase II. These studies recently culmi- 
nated in the purification and molecular clon- 
ing of a set of essential general initiation 
factors (GTFs) that promote basal transcrip- 
tion by polymerase II. In addition, a large 
number of sequence-specific DNA-binding 
transcription activator or repressor proteins 
have been isolated. It is now widely appreci- 
ated that transcription initiation can be 
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broadly divided into three steps: initiation 
complex assembly, isomerization, and promo- 
ter clearance. In the first step RNA polymer- 
ase and associated factors bind reversibly to 
the promoter. In the second step a stretch of 
promoter DNA becomes unwound and serves 
as a template for transcription. Finally, the 
polymerase leaves the promoter to elongate 
the transcript (Conaway and Conaway, 1994). 
The efficiency of each step can be subject to 
regulation by transcription activator or repres- 
sor proteins. DNA-bound transcription factors 
are thought to influence the transcription 
through protein-protein interaction with com- 
ponents of the general initiation factors result- 
ing in an enhanced recruitment or stabilization 
of the basal promoter complex to the tem- 
plate. Upon binding their cognate sites, activa- 
tor proteins stimulate transcription via an 
activation domain that is functionally distinct, 
and usually physically separate, from the 
DNA-binding domain (Latchman, 1991). A 
subset of activators are expressed in a cell 
type-specific manner, e.g. Oct-2 in B cells, and 
regulate the expression of genes specific for 
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the given cell type. Other activators, such as 
Spl, are ubiquitously expressed and are 
required for the constitutive and inducible ex- 
pression of a variety of genes. The human 
transcription factor Spl binds to the so-called 
GC boxes, and has long been thought to be 
the single factor acting through these sites 
(Kadonaga et al, 1987). However, this view 
has changed following the isolation of cDNAs 
encoding Spl -related proteins, indicating the 
presence of a novel Spl multigene family. To 
date three Spl -related proteins, which have 
been called Sp2, Sp3 and Sp4 have been 
described (Hagen et al, 1992; Kingsley and 
Winoto, 1992). They contain zinc fingers as 
well as glutamine and serine/threonine rich 
amino acid stretches similar to those of Spl. 
The DNA binding domains of the Sp2, Sp3 
and Sp4 proteins are highly conserved and 
they recognize GC boxes with specificity and 
affinity similar to that of Spl. Two more dis- 
tantly related factors, BTEB and BTEB2, have 
been isolated by virtue of their ability to bind 
a GC-box (Imataka et al, 1992; Sogawa et al, 
1993). Clearly, the existence of proteins with 
DNA-binding specificity similar to Spl indi- 
cates that gene regulation by Spl is more com- 
plicated than previously assumed. 

ISOLATION OF THE SP1-FAMILY MEMBERS 

The human transcription factor Spl was 
originally identified as a factor able to bind 
and activate the SV40 early promoter. The 
Spl protein binds selectively to a GC-rich 
decanucleotide sequence known as the "GC 
box", which is present as six tandem copies in 
the SV40 early promoter (Kadonaga et al, 
1987). Subsequent studies revealed that a var- 
iety of cellular and viral promoters contain 
GC boxes and can be activated by Spl in 
vitro; these GC boxes are often found near 
binding sites for other transcription factors, 
suggesting that these factors may act in con- 
junction with each other to modulate tran- 
scription. Cloning of the Spl cDNA from 
HeLa cell RNA showed that it contained 
three putative zinc fingers of the Cys 2 His 2 type 
(Kadonaga et al., 1987). Functional analysis 
have delineated portions of Spl required for 
activation, DNA-binding and multimerization, 
respectively (Courey and Tjian, 1988; Courey 
et al., 1989; Pascal and Tjian, 1991). The Spl 
trans-activation domain has been subdivided 
into two subdomains (A and B) with regions 



rich in serine/threonine and glutamine resi- 
dues, and a region close to the zinc fingers lar- 
gely composed of charged amino acids, which 
synergistically stimulate transcription. The Spl 
protein has also been shown to be modified 
with O-linked oligosaccharides in vivo and 
phosphorylated by a DNA-dependent protein 
kinase (Jackson and Tijan, 1988; Jackson et 
al., 1990). Thus Spl may be a target of intra- 
cellular signalling, although it is not yet clear 
whether these post-translation modifications 
play any role in regulating or modifying the 
Spl-mediated transcription. Moreover, it is 
uncertain whether Spl-mediated transcription 
is actively regulated in mammalian cells or is 
simply determined by the amount of Spl pro- 
tein within the nucleus. The mouse Spl gene 
was shown to be ubiquitously expressed but 
different cell types and different stages of 
development show up to 100-fold differences 
in the levels of Spl mRNA (Saffer et al., 1990, 
1991). Based on these considerations, it is 
likely that Spl acts in combinatorial manner 
with other transcription factors, which may 
have more pronounced temporally or spatially 
restricted expression patterns. 

Several proteins with homology to Spl have 
been identified in a number of different sys- 
tems (Fig. 1). A rat protein designed BTEB 
was cloned by virtue of binding the BTE box, 
a GC box sequence in the promoter of the rat 
P4501A1 gene (Imataka et al., 1992). Both 
Spl and BTEB affect expression of P4501A1 
through the BTE sequence. Using the rat 
BTEB cDNA as a probe, a related cDNA 
designated BTEB2 was isolated from human 
placenta library (Sogawa et al, 1993). This 
gene was found to be expressed specifically in 
testes and placenta, but no known function 
has yet been described. The two Spl -related 
proteins called Sp2 and Sp3 were both ident- 
ified for their binding to a GT box motif in 
the T-cell antigen receptor (TCR) a promoter 
(Kingsley and Winoto, 1992). These genes 
were cloned based on the sequence similarity 
of their zinc fingers to those of Spl. Sp2 pro- 
tein was found to bind the GT box motif 
weakly, whereas Sp3 bound this sequences 
with high affinity comparable to that of Spl. 
Two human transcription factors, designated 
originally SPR-1 and SPR-2 (for "Spl related 
proteins"), were isolated because of their abil- 
ity to bind a GT box motif in the rabbit utero- 
globin promoter (Hagen et al, 1992). In an 
attempt to identify transcription factors re- 
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Sp1 N-U 

Sp2 



Sp3 
Sp4 
BTEB1 



BTEB2 



mm S/T-rich 

KSS Q-rich 

am +/- E/K/R-rich 

Zn finger 

Zn finger 
QBESB Zn finger 

Fig. 1. Schematic representation of the functional domains of Spl and comparison with those present 
in the related Spl-like proteins. 



sponsible for the relatively high expression of 
uteroglobin in particular cell types, the human 
endometrial cells (Ishikawa cell line) and the 
lung cell line NCI-H441, an expression library 
made from Ishikawa cell mRNA was screened 
with a concatemer of the GT box binding site 
and SPR-1 and a related SPR-2 cDNA were 
cloned (Hagen et al., 1992). SPR-2 is identical 
to Sp3, and SPR-1 has been renamed Sp4. 
The zinc finger regions of these proteins and 
of Spl bind to both GT and GC box motifs 
with similar affinities. The discovery of a 
family of transcription factors in vertebrates 
with homologous zinc finger amino acid 
sequence (Fig. 2) and similar DNA binding 
properties raises the intriguing possibility that 
genes once thought to be regulated by Spl 
might be controlled in vivo by protein(s) other 
than Spl. Another homologue of Spl is the 
buttonhead {btd) gene of Drosophila, which is 



expressed in the head anlagen of early 
Drosophila embryos (Wimmer et al., 1993). 
The btd gene plays an important role during 
Drosophila development, with btd mutant flies 
having abnormalities in head morphogenesis. 
A transgene expressing the human Spl gene 
was able to partially rescue the phenotype of 
btd mutant flies (Wimmer et al., 1993), 
demonstrating at least some degree of func- 
tional conservation between the Drosophila 
and vertebrate proteins. 



Inspection of the Spl-like cDNAs isolated 
in different laboratories has revealed the pre- 
sence of a highly conserved DNA binding 
domain located near the C-terminus of the 
proteins. As represented in Fig. 1, each protein 
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Spl GKKKQHICHIQGCGKVYGKTSHLRAHLRWHTGER 

Sp2 Q V PD TFR L V-L 

Sp3 P s finger 1 

Sp4 E 

BTEB ASE-R-K-PYS S K— Y-V 

BTEB2 E-RRI-Y-DYP — T T-S K T K 



Spl PFMCTWSYCGKRFTRSDELQRHKRTHTGEK 

Sp2 --V-N-FF A D- 

Sp3 --V-N-M R finger 2 

Sp4 --I-N-MF R 

BTEB --P PD-L S T— Y 

BTEB2 -YK- — EG-DW— A T--Y-K A- 



Spl KF AC PEC PKRFMRSDHL S KH I KTHQNKK 

Sp2 R-E-AQ-Q T--Y---LVT- 

Sp3 --V S A finger 3 

Sp4 R-V S V 

BTEB Q-R--L-E T--ARR-TDFH 

BTEB2 P-Q-GV-NRS-S AL-M-R--- 



Fig. 2. Comparison of the amino acid sequences of the zinc finger region of Spl and related proteins as 
indicated. Dashes represent identical residues. 



contains three zinc fingers with structure of 
Cys 2 -His 2 . Homology to Spl at the zinc finger 
domain varies from 59% for BTEB2 to 90% 
for Sp3 and Sp4. Analysis of the DNA-bind- 
ing activities revealed that Sp3, Sp4, BTEB 
and BTEB2 proteins recognize GC and/or 
GT-box motifs with specificity and affinity 
very similar to that of Spl (Hagen et al., 1992; 
Imataka et al., 1992; Kingsley and Winoto, 
1992; Sogawa et al., 1993; Hagen et al., 1994). 
Contrary to these factors, Sp2 which binds to 
the GT box in the promoter of a T-cell recep- 
tor gene, seems to have divergent nucleotide 
recognition properties (Kingsley and Winoto, 
1992). The predicted Sp3 and Sp4 protein 
sequences have extensive homology to Spl 
throughout the entire open reading frame, 
whereas homology outside the zinc finger 
domain of Sp2 is much more limited. 

Both BTEB and BTEB2 do not show any 
homology to Spl outside the DNA-binding 
domain. BTEB has an acidic region and sev- 
eral transcription factors such as GAL4 are 
equipped with an acidic domain as effector 
(Mitchell and Tjian, 1989). BTEB has been 
tested for its ability to regulate gene transcrip- 
tion and both Spl and BTEB activated the ex- 
pression of promoters with repeated GC box 
sequences such as the SV40 early promoters 
and the HIV-1 LTR promoters (Imataka et 
al., 1993). Interestingly, BTEB represses the 
activity of the promoter containing a single 



GC box that instead can be stimulated by Spl 
(Imataka et al, 1992). However, when the 
GC-box was repeated five times, BTEB turned 
out to be an activator. Therefore, it appears 
that BTEB could exert a different effect on the 
transcription of those genes with a different 
number and arrangement of GC-box 
sequences in the promoter region. Although 
BTEB mRNA is expressed in various mamma- 
lian tissues and cell lines, the presence of the 
BTEB protein is confined to brain and some 
cell lines, such as Neuro2A cells. It has been 
shown that BTEB mRNA is differentially 
translated depending upon the cellular context 
(Imataka et al., 1994). Finally, the human 
BTEB cDNA has been cloned and mapped to 
the chromosome 9ql3 (Ohe et al., 1993). 
BTEB2 is characterized by the presence of a 
Proline-rich domain, which is known to be 
one of the motifs responsible for transactiva- 
tion (Latchman, 1991). Accordingly, the 
BTEB2 proline-rich region fused to GAL4 
DNA-binding domain could enhance tran- 
scription of a reporter bearing the GAL4 
DNA-binding sites (Sogawa et al., 1993). 

Clearly, Spl, Sp3 and Sp4 are the Sp-mem- 
bers with the most close homology. All three 
proteins contain highly conserved DNA bind- 
ing domains consisting of three zinc finger 
motifs at the carboxyl terminus. Sp3 and Sp4 
also contain several putative functional 
domains homologous to those of Spl. These 
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include: (i) the amino-terminal glutamine- and 
serine/threonine-rich regions similar to the A 
and B transactivation domains of Spl; (ii) the 
highly charged domain immediately 5' of the 
zinc fingers; (iii) a carboxy-terminal domain 
similar to the D transactivation domain of 
Spl present in Sp3. However, the expression 
pattern of the three proteins is quite different. 
Both Spl and Sp3 proteins are ubiquitously 
expressed at a high level in many mammalian 
cell lines (Fig. 3), whereas Sp4 expression 
appears to be restricted to certain cell types of 
the brain. Finally, the chromosomal localiz- 
ation of the three genes has been reported. 
Spl has been mapped on chromosome 12ql3 

Ab-Sp1 + + 

Ab-Sp3 + + 




Fig. 3. Gel mobility shift analysis using as probe a labelled 
oligonucleotide spanning the three Spl -binding sequences 
present in the promoter of the HIV-1 LTR, and a nuclear 
extract from HeLa cells. The presence of specific anti- 
bodies in the binding reaction is indicated above each line. 
Specific complexes for Spl and Sp3 are indicated on the 
left. 



(Gaynor et al., 1993; Matera and Ward, 1993), 
Sp3 is located at 2q31 (Kalff-Suske et al., 
1996) and Sp4 on chromosome 7pl5 (Kalff- 
Suske et al., 1995). 

The high degree of structural conservation 
between Spl, Sp3 and Sp4 suggested that Sp3 
and Sp4 may regulate gene transcription in a 
manner similar to that of Spl. In the following 
sections we will describe the functional proper- 
ties of these factors in direct comparison with 
those of Spl. 

Spl IS REQUIRED FOR EARLY DEVELOPMENT 

A large number of observations imply a 
wider role of the Spl protein in the transcrip- 
tional regulation of tissue-specific and ubiqui- 
tous genes. Moreover, Spl has been 
implicated in the control of cell cycle-regulated 
genes such as thymidine kinase, B-myb and 
dihydrofolate reductase (Karlseder et al., 1996, 
Lin et al., 1996, Zwicker et al., 1996). It has 
been suggested that the activity of Spl may be 
regulated by members of the retinoblastoma 
family proteins pRB and pi 07 (Kim et al., 
1992; Udvadia et a!., 1993, 1995; Chen et al., 
1994; Datta et al., 1995) and by Gl-specific 
cyclins (Shao and Robbins, 1995). In addition, 
Spl -binding sites appear to play a critical role 
for the maintenance of the methylation-free 
CpG island (Brandeis et al., 1994, Macleod et 
al., 1994). Thus, it appears that Spl plays an 
important role in controlling putative cell- 
cycle regulated genes and it is required to pre- 
vent methylation of CpG islands. Despite a 
great deal of information on the molecular 
properties of Spl, little is known about its bio- 
logical function in vivo. Very recently Marin 
and collaborators (Marin et al., 1997) reported 
the inactivation of the mouse Spl gene and 
the characterization of the phenotype of the 
Spl-null mice. They found that the Spl -/- 
embryos are retarded in development, survive 
until day 9.5 of gestation and show a broad 
range of phenotypic abnormalities. The broad 
range of abnormalities suggests that Spl -de- 
ficiency causes a general cellular defect that 
precludes normal development and survival. 
Due to the large number of putative Spl tar- 
get genes it was not surprising that inacti- 
vation of Spl is not compatible with cellular 
growth and differentiation. However, Marin et 
al. (1997) also looked at the expression of 
putative Spl genes at the "housekeeping" and 
cell cycle-regulated TK, DHFR, AFRT and 
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HPRT, and some tissue-restricted as ApoAl 
and LDLR genes no differences were found. 
Moreover, they also analyzed the CpG islands 
and found that the islands remain methylation 
free. The only clear differences in the Spl 
embryos was found at the level of MeCP2 ex- 
pression. This protein binds the methylated 
DNA and it appears to be required for the 
maintenance of the differentiated cells (Tate et 
al, 1996). Marin et al (1997) suggested that 
Spl is dispensable for growth and differen- 
tiation of primitive cells and suggest that Sp 1 
is likely to play a pivotal role in the mainten- 
ance of differentiated cells likely through tran- 
scriptional regulation of genes such as 
MeCP2. It is safe to assume that other mem- 
bers of the Spl family, like Sp3 and or Sp4, 
may compensate at least in part for the loss of 
Spl activity in the transcriptional regulation 
of cell-cycle regulated and tissue restricted 
genes. Despite this potential compensatory 
function, the data reported by Marin et al. 
clearly demonstrated that Spl -dependent gene 
activation is strictly required for mouse devel- 
opment, 



Sp4 IS A TISSUE-SPECIFIC TRANSCRIPTIONAL 
ACTIVATOR 

The first indication that Sp4 functions as an 
activator much as Spl, came from transfection 
experiments into mammalian cell lines. 
According to the notion whereby Sp4 is a 
transcriptional activator, it was found that 
GAL4 fusion proteins, in which the two gluta- 
mine-rich domains were fused to the GAL4 
DNA-binding domain, exert the potential for 
transactivation of appropriate reporters, like 
those present in Spl (Majello et al., 1994; 
Dennig et al., 1995; Hagen et al., 1995). A 
detailed functional analysis of the transcrip- 
tional properties of Sp4 has been recently 
reported (Hagen et al., 1995). These studies 
demonstrated that Sp4 is an activator protein 
like Spl. It has been shown that Sp4 is able to 
trans-activate viral and synthetic promoters 
bearing GC or GT box motifs both in mam- 
malian cells and in Drosophila SL2 cells which 
are devoid of endogenous Sp-like activity. 
Moreover, transfection experiments in 
Drosophila SL2 cells revealed that in contrast 
to Spl, Sp4 is not able to act synergistically 
through adjacent binding sites. The region of 
Spl protein required for synergistic activation 
has been mapped (Pascal and Tjian, 1991). 



Sequence comparison (Fig. 1) indicates that 
Sp4 lacks the D-domain present in Spl. It has 
been shown that the D-domain is indispensa- 
ble for Spl -mediated synergistic transactiva- 
tion. The absence of the D-domain in Sp4 
may thus account for the lack of Sp4-mediated 
synergistic activation. Interestingly, Sp4- 
mediated activation is strongly enhanced in 
the presence of Spl mutant lacking the DNA 
binding domain. This finding could imply that 
the glutamine-rich domains of Sp4 and those 
of Spl are functionally related to each other 
(Hagen et al., 1995). 

Northern analysis clearly indicates that Sp4 
expression in vivo is restricted to certain cell 
types of the brain (Hagen et al., 1992). Thus, 
Sp4 may play an important role in governing 
expression of certain genes in these cells. This 
hypothesis has recently received experimental 
support by the study of the mouse sp4 gene 
during development (Supp et al, 1996). This 
study reported the isolation and mutation by 
gene targeting of the mouse Sp4 gene. The 
Sp4 gene was cloned from mouse because it is 
located within a region of chromosome 12 
deleted in the mutant mouse legless (lgl). Sp4 
gene has an interesting pattern of expression 
during embryonic development. In situ hybrid- 
izations showed that Sp4 is highly expressed in 
developing mouse brain. In early embryos, it 
is expressed in the posterior neuropore; layer, 
high expression levels were seen throughout 
the entire CNS, including the hypothalamic 
region and in the nasal epithelium and vomer- 
onasal organs. Lower levels of expression have 
been detected in several mesenchymal tissues 
such as the liver, testes and teeth. In adult 
mice, the gene is expressed more widely, with 
the highest levels seen in the brain. Analysis of 
Sp4-null mice indicated that this gene plays a 
role for normal growth and male fertility. 
Two-thirds of Sp4 homozygous mutants die 
within the first few days after birth, and those 
that survive are smaller than their wild-type 
litter-mates. The cause of the early death 
remains undetermined. The only other detect- 
able defect of these mutants is the lack of sex- 
ual activity in Sp4-null males, although 
structural abnormalities of the male reproduc- 
tive tract were not detected. The testes, epidi- 
dymis, vas deferens and prostates of the 
mutants were indistinguishable from wild type. 
Moreover, histological sections of the mutant 
testes revealed the presence of mature sperm. 
Therefore, a possible interpretation of these 
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findings could be that the mutation has caused 
a behavioural defect that affects male repro- 
duction. 

Sp3 IS A BIFUNCTIONAL TRANSCRIPTION 
REGULATOR 

Both Spl and Sp3 proteins are ubiquitously 
expressed in many mammalian cell lines, and 
as reported in Fig. 3, both proteins are 
expressed at comparable levels. Transfection 
experiments into mammalian cell lines indi- 
cated that Sp3 was not able to activate various 
Spl -responsive promoters (Hagen et al, 1994; 
Majello et al, 1994, 1995; Dennig et al, 
1995). However, a severe limitation of these 
experiments is due to the presence, in all the 
mammalian cell lines tested, of endogenous 
Spl and Sp3 proteins which could obscure 
and/or complicate the interpretation of these 
transfections. The Drosophila SL2 cell line 
lacks endogenous Sp factors, therefore it has 
been instrumental in analyzing the Sp3- 
mediated transcription. It has been originally 
found that the Sp3 failed to activate Spl -re- 
sponsive promoters such as HIV-1 and 
HTLV-1 in Drosophila SL2 cells (Hagen et al., 
1994; Majello et al, 1994). Moreover, cotrans- 
fection experiments in Drosophila SL2 cells 
indicated that enforced expression of Sp3 
represses Spl -mediated activation of several 
viral and cellular promoters. A likely interpret- 
ation of these results is that Sp3 lacks transac- 
tivation potential and it can suppress the Spl 
activity simply because it binds to, and thus 
competes for, the same DNA binding sites. 
Therefore, Sp3-mediated transcriptional 
repression can result from competition for the 
same DNA-binding sites or steric hindrance 
between repressors and positively acting tran- 
scription factors, such as Spl. Because this 
type of repression results from displacement of 
transcription factors from the DNA, Sp3 
might not necessarily possess an active repres- 
sion function. Alternatively, Sp3 may actively 
repress Spl -dependent transcription due to the 
presence of a putative repressor domain. Such 
hypothesis is supported by the finding that a 
fusion protein consisting of the yeast GAL4 
DNA binding domain and the N-terminal part 
of Sp3 containing the glutamine-rich domains 
did not activate GAL4-responsive promoters 
in mammalian cell lines. Moreover, it was 
found that Tat-mediated activation of several 
HIV-1 LTR reporters, containing five Gal4 



DNA-binding sites inserted at different dis- 
tances from the transcription start site, was 
abrogated by co-expression of the chimeric 
GAL4-Sp3 protein (Majello et al, 1994). 
Distance-independent repression observed with 
the GAL4-SP3 fusion protein rules out the 
possibility that Sp3 is simply sterically block- 
ing the promoter. Repression at distance sup- 
ports a mechanism requiring protein-protein 
interactions between distantly bound Sp3 pro- 
tein and proximal factors, such as TAFs, loop- 
ing out intervening DNA (Licht et al., 1993, 
Hanna-Rose and Hansen, 1996). This con- 
clusion has received further support by a 
study describing in vivo transfections, in which 
the non-finger region of Sp3 and a defined 
activating domain connected to different heter- 
ologous DNA-binding domains were both tar- 
geted to a promoter (De Luca et al, 1996). 
These results suggest that Sp3 may act as a 
transcriptional repressor of RNA polymerase 
II promoters by protein-protein interaction 
with components of the general transcription 
complex (De Luca et al, 1996). 

Based on the above mentioned results, it 
appears that Sp3 is a DNA-binding dependent 
repressor. However, in a recent report Sp3 has 
been described as a weak activator of several 
cellular promoters in Drosophila SL2 cells and 
stimulate transcription by functional inter- 
action with the retinoblastoma protein 
(Udvadia et al, 1995). Moreover, using 
Drosophila SL2 cells, Birnbaum et al (1995) 
examined the ability of Sp3 to repress Spl -dri- 
ven transcription from several eukaryotic pro- 
moters that contain GC boxes. These 
investigators found that Sp3 repressed Spl ac- 
tivation with the dihydrofolate reductase pro- 
moter, which contains multiple functional GC- 
boxes. In contrast, Sp3 was not capable of 
affecting Spl activation of the histone H4 or 
the thymidine kinase promoters, both cell 
cycle-controlled promoters containing single 
GC-boxes. These studies suggest that Sp3 
abrogation of Spl -driven transcription is pro- 
moter-specific. 

Two very recent independent reports 
demonstrated that Sp3 has the potential to 
activate transcription in both mammalian and 
Drosophila cells (Dennig et al, 1996; Majello 
et al, 1997). Both studies have used gene 
fusion experiments to dissect the functional 
domains of Sp3. It has been demonstrated 
that Sp3 is a bifunctional protein containing 
independent modular repressor and activator 
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domains. The activation potential of Sp3 is 
distributed over two glutamine-rich N-terminal 
regions. Both glutamine-rich domains of Sp3 
can stimulate transcription as efficiently as the 
corresponding Spl glutamate domains. Thus, 
both proteins have an N-terminus glutamine- 
rich region that functions as a transferable ac- 
tivation domain. The Sp3 transcriptional 
repressor activity has been mapped in a small 
amino acid region located at the 5' of the zinc 
finger DNA-binding domain, and this domain 
appears to be sufficient to confer repressor 
function when fused to a heterologous DNA- 
binding domain (Majello et al, 1997). Finally, 
it has been shown that all chimeric GAL4 pro- 
teins containing the Sp3 inhibitory/repressor 
domain function as transcriptional repressors 
when bound to a promoter bearing multiple 
DNA-binding sites. These results suggest that 
the repressor domain is able to mask the glu- 
tamine-rich activating domain. Accordingly, 
GAL4 fusions lacking the repressor/inhibitory 
domain function as transcriptional activators. 
The major finding reported in these studies 
relay in the demonstration that Sp3 protein 
has the potential to activate transcription due 
to the presence of the glutamine-rich domains 
located at the N-terminus (Dennig et al., 1996; 
Majello et al., 1997). These domains are very 
similar to the glutamine-rich activating 
domains present in Spl. In vivo and in vitro 
cooperation between Spl and TFIID has been 
well documented. It has been demonstrated 
that the glutamine-rich activation domain of 
Spl can interact with the TBP-associated fac- 
tor dTAFHO and TBP itself (Emili et al., 
1994; Gill et al., 1994). Consistent with the 
structural similarity of the Spl and Sp3 acti- 
vating domains, a functional cooperation 
between the Sp3 protein and components of 
the basal transcription complex such as 
TAF110 and TBP has been reported (Dennig 
et al., 1996; Majello et al, 1997). 

In summary, from these studies it appears 
that Sp3 is a dual-function regulator and the 
context of cognate DNA-binding sites in a 
promoter appears to be one of the elements 
that determines the strength of the Sp3- 
mediated repression. 

Whatever the molecular mechanism respon- 
sible for the function of Sp3 may be, it is also 
clear that its repression ability is dependant 
upon the cellular context. Sjottem et al. (1996) 
reported that Sp3 had no stimulatory effect on 
transcription from the LTR of the HERV-H 



family human endogenous retrovirus-like el- 
ements in HeLa cells. However, in the embry- 
onal carcinoma cell line NTera2-Dl, Sp3 
stimulated the transcriptional activity of the 
LTR promoters Transcriptional down-regu- 
lation of a2-integrin and E-cadherin occurs in 
human mammary epithelial cells overexpres- 
sing Erb-B2 oncogene. It has been shown that 
Sp3 DNA-binding activity to the a2-integrin 
promoter is not altered in Erb-B2-overexpres- 
sing cells, whereas the Spl DNA-binding ac- 
tivity is reduced. Thus, it is plausible that Sp3 
is directly responsible for the downregulation 
of a2-integrin gene expression in Erb-B2 over- 
expressing cells (Ye et al., 1996). An additional 
example of the cell-type specific role of Sp3 
has been observed upon in vitro induced differ- 
entiation of primary human keratinocytes. In 
primary keratinocytes, Sp3 protein levels 
exceed those of Spl and this ratio became 
inverted after in vitro differentiation (Apt et 
al., 1996). Prowse et al. (1997) reported inci- 
dences that Sp3 is likely involved in the tran- 
scriptional induction of the cyclin-dependent 
kinase inhibitor p21 during terminal differen- 
tiation of keratinocytes. Moreover, a role for 
Sp3 acting as a repressor has been proposed 
for the elastin gene transcription. The insulin- 
like growth factor-I (IGF-I) increases elastin 
gene transcription in aortic smooth muscle 
cells and this up-regulation is accompanied by 
the loss of protein binding to the promoter 
(Conn et al., 1996), and it has been suggested 
that IGF-I-mediated increase in elastin tran- 
scription occurs via a mechanism of derepres- 
sion involving the abrogation of Sp3 binding 
to the GC-box present in the proximal promo- 
ter. Finally, Grekova et al. (1996) found that 
Sp3 gene transcription is suppressed in periph- 
eral blood mononuclear cells from most 
Multiple Sclerosis (MS) patients. However, at 
this stage it is uncertain whether the lack of 
Sp3 expression is directly involved in the MS 
phenotype or if it simply reflects genetic het- 
erogeneity. 

CONCLUSION AND FUTURE PERSPECTIVE 

Despite a great deal of recent progress in 
the understanding of the transcriptional con- 
trol of gene expression by different members 
of the Sp-like factors (Table 1), important 
questions remain unanswered. A key point is 
the elucidation of the role of specific Sp mem- 
bers in the control of gene transcription of a 
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Table 1. Sp factors 



Factor 


mRNA 


Protein 


Expression 


Mapping 


Function 


Association 


"Null" mice 


Spl 


8.2 kb 


90 kDa 








TBP; 


Lethal; required for early 






Ubiquitous 


12ql3 


Activator 


TAF110 


development 


Sp3 


5kb 


97kDa 


Ubiquitous 


2q31 


Repressor 


TAF110 


N.D. 




60 kDa 






Activator 


TBP; 




















Sp4 


7kb 


82 kDa 


restricted 


7pl5 


Activator 


N.D. 


Required for male fertility 



N.D. not determined 



given promoter. The presence of a set of fac- 
tors that interact with common binding sites 
raises the cogent question of how a promoter 
is regulated by different transcription factors 
with a similar DNA binding specificity. The 
molecular anatomy of several promoters 
reveals the presence of G-rich motifs often 
found adjacent to binding sites for other tran- 
scription factors, suggesting that these factors 
may act in conjunction with each other to 
modulate transcription. A physical and/or 
functional interaction between Spl and other 
transcription factors has been documented. A 
cooperative interaction between Spl and 
NF- k -B is required for HIV-1 enhancer in 
JurkatT cells (Perkins et ai, 1993). Such 
cooperative interaction has not been observed 
with the Sp4 protein, although both Spl and 
Sp4 proteins activate basal and Tat-mediated 
expression of HIV-1 promoter (Majello et ai, 
1994). p53 and Spl appear to interact and co- 
operate in Tumor Necrosis Factor-induced 
transcriptional activation of HIV-1 promoter 
(Gualberto and Baldwin, 1995). Spl and E2F- 
1 factors appear to functionally and physically 
interact and regulate transcription of the 
hamster dihydrofolate reductase and the 
mouse thymidine kinase promoters (Karlseder 
et ai, 1996; Lin et ai, 1996). It will be of 
great interest to determine whether such func- 
tional cooperations are specific for Spl, since 
they could also provide a functional assay to 
distinguish between the various Sp-related fac- 
tors. 

A further level of complexity in the regu- 
lation of genes containing Sp-binding sites in 
their regulatory region has been heightened by 
experiments showing that Spl -dependent tran- 
scription is influenced by specific cell-cycle reg- 
ulator proteins. It has been reported that RB 
and pi 07, two members of the retinoblastoma 
protein family, activate or repress Spl-depen- 
dent transcription in a cell-type-dependent 
manner (Kim et ai, 1992; Udvadia et al., 



1993, 1995; Chen et ai, 1994; Datta et ai, 
1995). Furthermore, it has been suggested that 
RB indirectly stimulates Spl transactivation 
by liberating Spl from a putative Spl negative 
regulator factor Spl -I (Chen et ai, 1994; 
Murata et al., 1994). Thus, an emerging theme 
in transcriptional control is that regulation 
can result from functional interactions 
between transcription factors. Functional in- 
teractions between Sp proteins and the cell- 
cycle regulators RB and pl07 indicate that 
such cooperation may occur on promoters 
containing Sp-binding sites. Interestingly, both 
RB and pi 07 proteins are intimately involved 
in the control of cell-cycle regulated transcrip- 
tion (Weinberg, 1995) and several cell cycle- 
regulated genes, including the c-myc, dhfr, 
cyclinE, cdk2 and E2F-1, contain Sp-binding 
sites in their regulatory regions (Azizkhan et 
ai, 1993). The Spl/Sp3-pRB family inter- 
actions may represent a fine regulation of 
transcription by utilizing a multiplicity of 
possible interactions. 
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Abstract 

GC-boxes and related motifs are frequently occurring DNA-elements present in many promoters and enhancers. In contrast 
to other elements it was generally thought that the transcription factor Spl is the only factor acting through these motifs. The 
cloning of paralogous genes of the Spl factor uncovered the existence of a small protein family consisting of Spl, Sp2, Sp3 and 
Sp4. All four proteins exhibit very similar structural features. They contain a highly conserved DNA-binding domain composed 
of three zinc fingers close the C-terminus and serine/threonine- and glutamine-rich domains in their N-terminal regions. The high 
degree of structural conservation between these four proteins suggested that they do exert similar functions. Molecular, genetic 
and biochemical analyses, however, demonstrated that Sp2, Sp3 and Sp4 are not simply functional equivalents of Spl. Here, I 
will summarize and discuss recent advances which have been made towards understanding the mode of action and biological 
function of individual family members. © 1999 Elsevier Science B.V. All rights reserved. 

Keywords: Review; Spl; Sp2; Sp3; Sp4; Zinc finger 



1. Introduction 

Transcriptional regulation is exerted by the combina- 
torial action of proteins binding to distinct promoter 
and enhancer elements. Usually a limited number of ex- 
acting DNA elements is recognized not only by a single 
transcription factor but by a set of different proteins 
which are often structurally related (Latchman, 1995). 
Important and widely distributed promoter elements are 
G-rich elements such as the GC-box (GGGGCGGGG) 
and the related GT/CACCC-box (GGTGTGGGG). 
These elements are required for the appropriate expres- 
sion of many ubiquitous, tissue-specific and viral genes. 
In addition, they occur frequently in the regulatory 
region of genes which are under a specific mode of 
control such as cell cycle regulation, hormonal activation 
and developmental patterning. 

For some time it has been known that the general 
transcription factor Spl (Specificity protein 1) can bind 
to and act through the GC-boxes and it was generally 
accepted that this protein is an extremely versatile protein 
involved in the expression of many different genes docu- 
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mented by more than 2600 citations. More recently, 
however, it became clear that Spl is not the only protein 
acting through 'Spl -binding sites' but simply represents 
the first identified and cloned protein of a small protein 
family. Currently this family consists of four proteins 
designated Spl, Sp2, Sp3 and Sp4. Accordingly our view 
on Spl and its function has changed significantly. Here, 
I will summarize and discuss advances which have been 
directed towards understanding the properties and func- 
tion of the individual Sp-proteins. 



2. Molecular cloning of Sp transcription factors 

Spl was originally identified as the transcription 
factor which binds to and activates transcription from 
multiple GC-boxes in the simian virus 40 (SV40) early 
promoter (Dynan and Tjian, 1983; Gidoni et al., 1984) 
and the thymidine kinase (TK) promoter (Jones et al., 
1985). Molecular cloning of a partial human Spl cDNA 
from HeLa cells was described in 1987 (Kadonaga et al., 
1987). Since that time, the cDNA sequence published 
and deposited in databases is still incomplete. It encodes 
for the 696 C- terminal amino acids of human Spl. The 
entire human Spl protein consists of 778 amino acids. 
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Table 1 

Function and properties of Sp-factors 



Spl Human: J03133; 

Mouse: AF062566, 
AF022363; Rat: 
D12768 



Human: X68560, 
S52144; Mouse: 
AF062567 



Human: X68561, 
S50516; Mouse: 
U62522; Rat: 
U07610 



Human: 12ql3 
(Gaynor et al., 
1993; Matera and 
Ward, 1993); 
Mouse: 15 (Saffer 
et al., 1990); Rat: 
7q36 (Scohy et al., 
1998) 



Human: 

17q21.3-q22 (Scohy 
et al., 1998); Rat: 
10q31-q32.1 (Scohy 
et al., 1998); 
Human: 2q31 
(Kalff-Suske et al., 
1996); Rat: 
3q24-q31 (Scohy 
et al., 1998) 



Human: 7pl5.3-p21 
(Kalff-Suske et al., 
1995); Mouse: 12 
(Supp et al., 1996); 
Rat: 6q33 (Scohy 
et al„ 1998) 



Ubiquitous, 
developmental 
variations (Saffer 
etal., 1991) 



Various cell lines; 
tissues unknown 
(Kingsley and 
Winoto, 1992) 

Ubiquitous (Hagen 
et al., 1992) and G. 
Suske, unpublished 



Transcriptional 
properties 

Activator (Courey 
and Tjian, 1988) 
Synergistic 
activation (Courey 
et al., 1989) ; 
Superactivation 
(Pascal and Tjian, 
1991) 



Repressor of 
Spl -mediated 
transcription 
(Hagen et al., 1994) 
Activator (Udvadia 
et al, 1995; Dennig 
et al., 1996) 



Predominantly in 
neuronal cells; also 
in certain epithelia 
(Hagen et al., 1992; 
Supp et al., 1996) 
and G. Suske, 
unpublished 



Activator (Hagen 
et al., 1994, 1995) 



Knockout Features/ 
phenotype miscellaneous 

Lethal at embryonic Two glutamine-rich 
day 10 (Marin activation domains 

et al., 1997) (Courey and Tjian, 

1988; Gill et al., 

1994); 

phosphorylated 
(Jackson et al., 
1990); glycosylated 
(Jackson and Tjian, 
1988) 

Unknown Original Sp2 clone 

incomplete; 
complete sequence 
in data bank 
D28588 

Unknown Two glutamine-rich 

activation domains; 
three isoforms 
(Hagen et al„ 1994; 
Kennett et al., 
1997); inhibitory 
domain (Dennig 
et al., 1996); 
translational start 
site of full length 
protein unknown 
Growth retardation; Two glutamine-rich 
males do not breed activation domains 



(Supp et al., 1996) 



(Hagen et al., 
1995); entire human 
genomic sequence in 
data bank: 
Accession No. 
AC004595 



Its sequence could be obtained from the Tjian laboratory 
upon request. More recently, highly conserved full length 
Spl cDNAs from rat and mouse have been cloned 
(Imataka et al., 1992; Yajima et al., 1998) (Table 1). 

The Spl -related transcription factors Sp3 and Sp4 
(originally also designated SPR-2 and SPR-1, respec- 
tively) were cloned by recognition site screening using 
the GT-box motif of the uteroglobin promoter as a 
probe (Hagen et al., 1992). Independently, Sp3 was 
obtained along with Sp2 from a human T-cell library 
by low stringency screening with the zinc finger-encoding 
region of human Spl as a probe (Kingsley and Winoto, 
1992). The originally published human Sp2 cDNA 
sequence assigned as complete coding sequence turned 
out to be incomplete. The complete coding sequence of 
Sp2 is deposited in the data bank as human mRNA for 
the KIAA0048 gene (Accession number D28588). The 
cDNAs coding for the murine homologues of Sp3 and 
Sp4 have been published recently (Supp et al., 1996; 
Yajima et al., 1998). 



3. Structural features of Sp-family members 

All four human Sp-family members have similar 
domain structures (Fig. 1A). They contain three zinc 
fingers close to the C-terminus and glutamine-rich 
domains adjacent to serine/threonine stretches in their 
N-terminal region. The 81 amino acids C2H2-type zinc 
finger region which represents the DNA-binding domain 
is the most highly conserved part of the proteins. 
Alignment of that region shows that Spl, Sp3 and Sp4 
are more closely related to each other than to Sp2 
(Fig. IB). According to structural studies on zinc finger 
proteins bound to DNA (Pavletich and Pabo, 1991; 
Fairall et al., 1993), one could predict that the amino 
acids KHA within the first, RER within the second and 
RHK within the third zinc finger contact specific bases. 
Especially these critical amino acids are all conserved in 
Spl, Sp3 and Sp4 but not in Sp2. Consistently, Spl, 
Sp3 and Sp4 recognize the classical Spl -binding site 
with identical affinity (Hagen et al., 1992, 1994). In Sp2, 
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Fig. 1 . Structural features of Sp-proteins. (A) Schematic representation of the four human Sp-family members Spl , Sp2, Sp3 and Sp4. Their length 
in amino acids is indicated on the right. The length of Sp3 refers to the full length isoform according to (Kingsley and Winoto, 1992). Colored 
boxes indicate regions of the proteins which are rich in glutamine (red) and serine/threonine (yellow) residues. The region preceding the first zinc 
finger (+/-) is rich in charged amino acids. The black boxes represent the zinc fingers. Lines above the draw of the Spl protein indicate the extent 
of four regions (A, B, C and D) which contribute to the transcriptional properties of Spl as defined by Tjian and coworkers (Courey and Tjian, 
1988). Known activation (AD) and inhibitory domains (ID) are indicated. (B) Protein sequence alignment of the zinc finger domains. Stars below 
the sequence indicate sequence identity. Cysteine and histidine residues which coordinate zinc ions are underlayed in green and protein regions 
which contact the DNA in blue. Arrows point to the amino acids which determine the recognition specificity by contacting specific bases of the 
DNA (adapted from Pavletich and Pabo, 1991; Fairall et al., 1993). The black lines and the zig-zag lines indicate cx-helical and p-sheet regions 
respectively. 



the important histidine residue within the first zinc finger 
is replaced by a leucine residue (Fig. IB). In accordance 
with this structural difference it was shown that Sp2 
does not bind to the GC-box but to a GT-rich element 
within the T-cell receptor gene 5 '-flanking region 
(Kingsley and Winoto, 1992). 



4. Evolutionary relationship of the four Sp genes 

The structural similarity of the four Sp-proteins sug- 
gests that they are evolutionally closely related. This is 
indeed the case and documented by their chromosomal 
localization in the human genome. All four Sp genes 
are found on paralogous chromosomal regions on 
human chromosomes 12ql3 (Spl), 17q21.3-q22 (Sp2), 
2q31 (Sp3) and 7q21.3-q22 (Sp4) (Matera and Ward, 
1993; Kalff-Suske et al., 1995, 1996; Scohy et al., 1998). 
The human Sp genes are linked to the homeobox gene 
cluster (O'Brien et al., 1993) on the corresponding 
chromosome (Spl/HOX C, Sp2/HOX B, Sp3/HOX D 
and Sp4/HOX A). This chromosome configuration is 
maintained in rodents with the exception of Sp4 which 
dissociated from the Hox a genes in the mouse and in 
the rat (Saffer et al, 1990; Supp et al., 1996; Scohy 
et al., 1998). 

Except for the human Sp4 gene, the sequence and 



the exon-intron structures of the individual chromo- 
somal Sp genes are only partially known (Fig. 2). The 
entire sequence of the human Sp4 gene is contained 
within the BAC clone RG023M10 (Accession No. 
AC004595). Cloning and mapping of the entire murine 
Sp4 gene (G. Suske, unpublished data) revealed that the 
exon-intron structure of the Sp4 gene is conserved 
between mouse and man. Partial gene structure informa- 
tion of the mouse Spl (Chestier and Charnay, 1992; 
Marin et al., 1997) and Sp3 genes (G. Suske, unpub- 
lished data) are also known (Fig. 2). The available data 
show that the exon-intron structures are conserved 
among individual Sp-family members supporting the 
conclusion that the four genes arise from a single 
ancestor gene. 

Little is known about Sp genes in non-mammalian 
vertebrates and invertebrates. In vitro binding studies 
and biochemical characterizations have demonstrated 
the existence of GC-box binding proteins in fish which 
share immunological properties with Spl and Sp3 
(Baudler et al., 1997). In Drosophila melanogaster, the 
head-specific protein buttonhead (btd) encodes a zinc 
finger type transcription factor with significant sequence 
and functional similarity to the Sp-proteins (Wimmer 
et al., 1993). However, the arrangement of the zinc 
finger region relative to the other domains is slighly 
different in btd. The region C-terminal to the zinc finger 



294 



G. Suske I Gene 238 (1999) 291-300 



| mSp3 gene | 




\hSp4gene | ^^T) 



- 83 017 Kb- 



Fig. 2. Known genomic structures of Spl, Sp3 and Sp4 genes substantiate their close evolutional relationship. The central drawing shows the 
structure of the Sp-proteins schematically. The glutamine-rich activation domains A and B and the zinc fingers are indicated. Known exons 
(numbered 1 to 6 in the case of the Sp4 genes) of the mouse Spl, Sp3 and Sp4 genes are connected with the corresponding protein regions. Both 
glutamine-rich activation domains are encoded on a single large exon. The human Sp4 gene contains a CpG island in its 5'-flanking region. 



region is extended and contains a serine/threonine-rich 
domain. Btd is required for development of the antennal, 
intercalary and mandibular segments of the fly head. A 
human Spl transgene rescued the btd null phenotype 
partially. It supported development of the mandibular 
segment in the head of btd mutant embryos when 
expressed in the spatial pattern of btd (Wimmer et al., 
1993). It would be interesting to know whether any of 
the other three Sp-proteins would have the capacity to 
rescue fully the btd null phenotype in flies. Since Sp4 is 
expressed predominantly in the brain, it could be a good 
candidate gene. 



5. Functional properties of individual Sp-family members 

5.1. Spl: The prototype of the family 

Molecular cloning of Spl and its subsequent dissec- 
tion revealed the functional domains of the protein. 
Both glutamine-rich regions (designated A and B) 
(Fig. 1A) can act as strong activation domains (Courey 
and Tjian, 1988). Mapping of the activation domain 
revealed that interspersed bulky hydrophobic amino 
acids are essential for transcriptional activation and not, 
per se, the glutamine residues (Gill et al., 1994). More 
recently, an inhibitory domain has been mapped to the 
N-terminus (Murata et al., 1994). Synergistic activation 
of promoters by Spl through multiple GC-boxes 
requires in addition the short C-terminal domain D 
(Pascal and Tjian, 1991 ). Spl is known to be phosphoryl- 
ated (Jackson et al., 1990) and glycosylated (Jackson 
and Tjian, 1988), and it is capable of forming homotypic 
interactions leading to multimeric complexes 
(Mastrangelo et al., 1991; Pascal and Tjian, 1991; Su 



et al., 1991). In addition, many heterotypic interactions 
with different classes of nuclear proteins have been 
reported. These include factors belonging to the general 
transcription machinery, such as the TATA-box binding 
protein TBP (Emili et al., 1994) and the TBP-associated 
factors dTAFIIHO/hTAFIIBO (Hoey et al., 1993; 
Tanese et al., 1996), and hTAFII55 (Chiang and Roeder, 

1995) . Other proteins which have been shown to interact 
with Spl are cell cycle regulators such as the retinoblas- 
toma-related protein pl07 (Datta et al., 1995) and 
transcription factors such as YY1 (Lee et al., 1993; Seto 
et al., 1993) or E2F (Karlseder et al„ 1996; Lin et al., 

1996) . Spl can bind to its target sequence in assembled 
nucleosomes (Li et al., 1994), and in this regard it is 
interesting to note the interaction with a large coactiva- 
tor complex called CRSP (cofactor required for Spl 
activation) which stimulates Spl -mediated transcription 
in vitro (Ryu et al., 1999). 

5.2. Sp2 

There exists no functional analysis of Sp2 and corre- 
spondingly little is known about this Sp-family member. 
The presence of a potential glutamine-rich activation 
domain suggests that it may act as an activator. The 
T-cell antigen receptor a (TCRa) gene might be a target 
for Sp2 since it binds in vitro to a GT-box promoter 
element within the TCRa promoter (Kingsley and 
Winoto, 1992). 

5.3. Sp3: Activator versus repressor 

Unraveling the transcriptional role of Sp3 was com- 
plicated by the fact that three Sp3 isoforms exist, a 
110-115 kDa Sp3 protein and two approximately 
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60-70 kDa Sp3 species. Very likely, the full length Sp3 
protein is derived by translational initiation at a non- 
AUG which has been arbitrarily assigned to an AUU 
codon (Kingsley and Winoto, 1992). However, other 
potential non-AUG start codons are also present in that 
region and thus it remains to determine the N-terminus 
of Sp3. It has also been suggested that the published 
Sp3 cDNA lacks 5 '-untranslated sequences and the 
extreme N-terminus of the Sp3 protein. I consider this 
possibility unlikely since in vitro translation of the full 
length isoform of Sp3 from the published mRNA has 
been described (Kingsley and Winoto, 1992; Kennett 
et al., 1997). The two smaller Sp3 species arise from the 
first two internal AUG codons (G. Suske, unpublished 
observation, and Kennett et al., 1997). Consistently, 
antibodies raised against the N-terminus of Sp3 recog- 
nize in EMSAs the slow migrating complex which 
contains the full length Sp3 isoform but not the two 
fast migrating complexes which contain the two 
N-terminally truncated isoforms (Fig. 3). 

Reports on the transcriptional properties of Sp3 
appear, at first sight, contradictory. Sp3 has been shown 
to act as a transcriptional activator similar to Spl 
(Udvadia et al, 1995; Liang et al., 1996; Ihn and 
Trojanowska, 1997; Zhao and Chang, 1997, and many 
others). In other experiments, Sp3 remained inactive or 
acted only as a very weak activator (Hagen et al, 1994; 
Majello et al, 1994; Dennig et al, 1995; Kumar and 
Butler, 1997 and others). Most of these reports are 
based on co-transfection experiments into the insect cell 
line SL2. Usually, a distinct promoter fragment contain- 
ing appropriate Sp-binding sites fused to a reporter gene 
was co-transfected along with Spl and Sp3 or both 
together. If Sp3 is expressed to the same extent as Spl 
but does not act as a strong activator, it will compete 
for the same binding site and thus lower Spl -mediated 
activation. There are only a few reports in which addi- 
tional approaches have been chosen to determine the 
potential role of Sp3 for the activity of a certain pro- 
moter (Noti, 1997; Hata et al, 1998). An antisense 
strategy which knocked out endogenous Sp3 expression 
in the myelomonocytic cell line HL60 revealed that Sp3 
participates in the activation of the CDllc and CDllb 
promoters (Noti, 1997). 

The experimental conditions which are needed for 
Sp3 to act as a strong activator or a transcriptional 
inactive molecule which represses Spl -mediated activa- 
tion are not completely understood. The structure and 
the arrangement of the recognition sites appear to 
determine whether Sp3 is transcriptionally inactive and 
can repress Spl -mediated activation or whether it acts 
as a strong activator. Promoters containing a single 
binding site are activated, whereas promoters containing 
multiple binding sites often are not activated or respond 
weakly to Sp3 (Birnbaum et al, 1995; Dennig et al, 
1996). Whether Sp3 acts as an activator or as a repressor 



N -l A 



IP ■ ■ IIl-c 



oSp3N 



ccSp3 




1 2 3 4 5 6 7 



Fig. 3. Spl and the three isoforms of Sp3 are visible in an electropho- 
retic mobility shift assay (EMSA). A schematic representation of Sp3 
is shown at the top. The activation domains A and B, the inhibitory 
domain (ID), the charged region (+/-) preceding the DNA-binding 
domain and the three zinc fingers (black boxes) are depicted. Asterisks 
indicate the N-termini of the three isoforms. The black lines indicate 
the segments of Sp3 which have been used for generation of the anti- 
bodies aSp3N and aSp3. The EMSA was performed with a 
32 P-labeled GC-box oligonucleotide and nuclear extract from CV-1 
cells. Antisera against Spl (aSpl, lane 3), the N-terminal part of Sp3 
(aSp3N, lane 5), the C-terminal part of Sp3 (txSp3, lane 2) or the 
corresponding preimmune sera (Pre, lanes 1, 4 and 6), respectively, 
were included in the binding reactions as indicated. Complexes contain- 
ing Spl or Sp3 are indicated at the left. The antiserum directed to the 
C-terminus shifted all three Sp3-containing complexes (lane 2) whereas 
the antiserum directed to the N-terminus shifted only the slow migrat- 
ing complex (lane 5). 

of Spl -mediated activation might also depend of the 
cellular context. Transfected Sp3 stimulated transcrip- 
tion from the HERV-H long-terminal repeat in the 
teratocarcinoma cell line NTera2-Dl but acted as a 
repressor in HeLa and insect cells (Sjottem et al, 1996). 

It has been suggested that the two small Sp3 isoforms 
might act as repressor molecules whereas the full length 
Sp3 isoform does act as an activator (Kennett et al, 
1997). Although attractive and simple, this model does 
not seem to hold true. Exclusive expression of full length 
Sp3 triggered by an artificial leader sequence can also 
repress Spl-mediated activation (Dennig et al, 1996). 



296 



G. Suske I Gene 238 (1999) 291-300 



It is clear that both N-terminal glutamine-rich regions 
can act as strong activation domains on their own in 
both insect and in mammalian cells (Dennig et al., 1996; 
Majello et al., 1997). The molecular basis for the inactiv- 
ity of Sp3 under certain conditions has been mapped to 
an inhibitory domain located between the second gluta- 
mine-rich activation domain and the first zinc finger. 
The amino acid triplet KEE within this domain is 
absolutely essential for repressor function (Dennig et al., 
1996). Mutation of these amino acids to alanine residues 
converted almost inactive Sp3 to a strong activator. The 
inhibitory domain of Sp3 acts as an independent modul 
in cis. It can be transferred to other activation domains 
which in turn lose their activation properties (Dennig 
et al., 1996). Yet, we do not know how this domain 
functions mechanistically. Purified recombinant Sp3 
expressed in SL2 cells (Braun and Suske, 1999) act in 
an in vitro system as strong activator similar to Spl (H. 
Braun and G. Suske, unpublished). From this, and 
other observations, it can be concluded that additional 
proteins which act as co-repressors are involved in the 
inhibitory function of this domain. Our recent cloning 
of a protein designated SIF-1 (Sp3-interacting protein 
1) which specifically interacted with the wild type inhibi- 
tory domain but not with the mutated form support 
this idea (A. Doll and G. Suske, unpublished). 

5.4. Sp4: The tissue-specific Sp-family member 

Sp4 was cloned along with Sp3 by virtue of its 
binding to the GT-box of the uteroglobin promoter 
(Hagen et al., 1992). In contrast to Spl and Sp3 which 
are ubiquitous transcription factors, Sp4 expression 
appears to be restricted to a few tissues. High levels of 
Sp4 are predominantly found in the brain (Hagen et al., 
1992; Supp et al., 1996). Consistent with its high expres- 
sion in the central nervous system, targeted disruption 
of the mouse Sp4 gene by homologous recombination 
led to behavioral defects (see Section 7). 

Sp4 exhibits specific functional properties distinct 
from Spl and Sp3. The transactivation function of Sp4 
resides, like that of Spl, in the N-terminal glutamine- 
rich regions. In contrast to Spl, Sp4 is not able to act 
synergistically through adjacent binding sites although 
Sp4 can function as a target for the Spl activation 
domains in a superactivation assay suggesting that the 
activation domains of Spl and Sp4 are functionally 
related (Hagen et al., 1995). 



6. Regulation by the ratio of Spl and Sp3 

It is clear that in a given cell type, co-expression of 
Spl and Sp3 occurs and we have to assume that these 
two proteins compete for the same binding sites in vivo. 
The initial characterization of Spl and Sp3 shows that 



they differ in their capacity to activate or repress tran- 
scription. Independently of whether Sp3 acts as an 
activator or as a repressor of Spl -mediated activation, 
the relative abundance of Spl and Sp3 should allow 
regulation of gene activities. The abundance of Spl and 
Sp3 varies among different cell types. In endothelial cells 
which contain high levels of both Spl and Sp3, the 
Spl/Sp3 ratio is higher than in non-endothelial cells 
(Hata et al., 1998). The endothelial-specific activity 
of the KDR/flk-1 promoter was mapped to an 
Spl/Sp3-binding site, and Sp3 attenuated Spl-mediated 
KDR/flk-1 promoter activation. Thus, it was suggested 
that endothelial selective KDR/flk-1 expression may be 
partially mediated by the high Spl/Sp3 ratio in these 
cells (Hata et al., 1998). 

Alterations in the relative abundance of Spl and Sp3 
upon different conditions have been reported in several 
cases. In primary keratinocytes, Sp3 levels exceed those 
of Spl. The Sp3/Spl ratio beomes inverted if these cells 
differentiate in vitro on treatment with high calcium 
suggesting that cell type differential transcription of 
several genes might be regulated by Spl and Sp3 (Apt 
et al., 1996). Worth mentioning in this context is that 
in differentiated keratinocytes only Sp3, but not Spl, 
overexpression enhanced the induction of the p21 pro- 
moter (Prowse et al., 1997). 

A change in the Spl/Sp3 ratio occurred also when 
C2C12 myocytes were cultivated under hypoxic condi- 
tions. Hypoxia caused a progressive depletion of Sp3 
whereas the Spl protein level remained unchanged 
(Discher et al., 1998). Like other glycolytic enzymes the 
muscle-specific pyruvate kinase-M and p-enolase are 
upregulated under low oxygen pressure. The hypoxia 
responsive promoter elements in these two genes have 
been mapped to GC-rich elements bound by Spl and 
Sp3. Thus, it was concluded that hypoxia activates these 
glycolytic enzyme gene promoters by down-regulating 
Sp3, thereby removing the associated transcriptional 
repression (Discher et al., 1998). 

In another report, it was shown that downregulation 
of a2-integrin gene expression in mammary epithelial 
cells by Erb-B2 and v-Hras is mediated by two adjacent 
Spl/Sp3 binding sites in the a2-integrin promoter. In 
that case, however, the Sp3 DNA-binding activity 
remained unaltered whereas the Spl DNA-binding activ- 
ity was reduced (Ye et al., 1996). 

Although not shown so far, one could imagine that 
alterations of the transcriptional capacity of Spl or Sp3 
could be obtained by other mechanisms. Especially, Sp3 
could be a target of signal transduction pathways since 
it has both activation and repression functions. 
Modifications in the protein might alter the potency of 
the protein in one or the other direction. Altogether, 
further unraveling of the molecular structure and modi- 
fications of the protein should give insights into the 
specific role of Sp3. 
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7. Physiological function of Sp-proteins 

In the past, a large variety of biological functions 
have been assigned to Spl -binding sites and to Spl. 
However, the identification of the three paralogous 
proteins Sp2, Sp3 and Sp4 raises the question as to 
which tasks are performed by which protein. This ques- 
tion is particularly interesting for Spl and Sp3 because 
both proteins are present in the same cell and are 
indistinguishable in their DNA-binding specificity. Gene 
disruption in mice is a powerful tool for obtaining 
information on specific functions of individual 
Sp-proteins. 

Given that Spl is implicated in the activation of a 
very large number of genes, such as housekeeping, 
tissue-specific and cell cycle-regulated genes, and is 
required to prevent methylation of CpG islands 
(Brandeis et al., 1994; Macleod et al., 1994), one would 
expect that cells lacking Spl would not survive. 
Surprisingly, this is not the case. S. Philipsen and his 
coworkers have disrupted the mouse Spl gene and found 
that Spl -deficient embryonic stem cells (ES cells) are 
viable, have normal growth characteristics and can be 
induced to differentiate and form embryoid bodies as 
efficiently as wild type ES cells (Marin et al., 1997). 
Nevertheless, Spl is essential for normal mouse embryo- 
genesis. The iS/?/-knockout embryos are severely 
retarded in development and they all died around day 
1 1 of gestation. They displayed a marked heterogeneity 
in phenotype indicating that Spl has indeed a general 
function in many cell types. Interestingly, the defects in 
Spl-/- mice are caused by a cell autonomous mechanism. 
Spl-/- ES cells injected into blastocysts contributed 
efficiently to chimaeric embryos at early stages but after 
day E 1 1 they rapidly declined with no contribution to 
newborn mice. Thus, Spl appears to be a transcription 
factor whose function is essential for differentiated cells 
after day 10 of development. 

Independently of the severe developmental defects of 
the Spl null mice, the embryos express many putative 
Spl target genes at normal levels, including housekeep- 
ing and cell-cycle regulated genes. In addition, CpG- 
islands remained methylation free. So far, the only genes 
which were found to be expressed at a lower level in 
Spl-/- mice are the thymidine kinase and the methyl- 
CpG binding protein 2 (MeCP2) genes (Marin et al., 
1997). Since there are striking similarities between the 
phenotypes of the MeCP2 (Tate et al., 1996) and the 
Spl knockouts, it was suggested that the MeCP2 gene 
might be a key target of Spl. However, whether Spl 
acts as a direct regulator of MeCP2 expression by 
binding to promoter, enhancer or local control region 
elements in the MeCP2 gene, or whether additional 
proteins mediate downregulation of MeCP2 remains to 
be established. 

The Spl -knockout demonstrated that Spl is not 



essential for the expression of many genes previously 
shown be activated in cell culture transfection experi- 
ments. One could speculate that other Sp-family mem- 
bers compensate at early embryonic stages, at least in 
part for the loss of Spl activity. Sp3 would be a good 
candidate because it is also ubiquitously expressed, has 
the potential to activate transcription and its DNA- 
binding activity is indistinguishable from Spl. In that 
regard, the phenotype of Sp3 knockout mice will be of 
great interest. 

Sp4, the tissue restricted member of the Sp-family, is 
predominantly expressed in the brain but also detectable 
in epithelial tissues, testis and developing teeth (Hagen 
et al., 1992; Supp et al., 1996). Disruption of the mouse 
Sp4 gene revealed that it is important for early post- 
natal survival (Supp et al., 1996). Approximately two 
thirds of the Sp4-/- mice die within a few days of birth. 
The cause of the early death remains unknown. Those 
mice which survive are significantly smaller than their 
wild type littermates. The reduced body weight appears 
to result from an unknown, but growth hormone inde- 
pendent, mechanism (Supp et al., 1996). Interestingly, 
surviving mice exhibit a striking sex-specific abnormal- 
ity. While fertility of the female mutants appears normal, 
males do not breed although their reproductive organs 
are fully developed and apparently normal. It appears 
that male Sp4-/- mice are unable to copulate. The most 
likely cause of this abnormal behavior is a neurological 
defect. The hypothalamus and the vomeronasal organ 
are known to play important roles in reproductive 
physiology and behavior. However, both structures are 
histologically normal in Sp4-/- mice. Thus, we await 
further investigation to understand the role of Sp4 and 
to identify its target genes. 

In that context, it should be noted that for both the 
Spl and the Sp4 knockouts, the zinc finger regions have 
been chosen to inactivate the genes. In both cases the 
N-terminal part encoding the transactivation domains 
are still expressed (Supp et al., 1996; Marin et al., 1997). 
One could speculate that the activation domains on 
their own might act as a gain-of-function or might 
interfere with other Sp-family members. In the case of 
the Sp4 knockout such a scenario does not seem to be 
the case. Knockout Sp4-/- mice which do not express 
the N-terminal part of the protein manifest the same 
phenotype (G. Suske, unpublished observations). 



8. Yet more and more GC/GT-box binding proteins 

In the past, essential GC-boxes in promoters were 
often equated with "Spl-binding sites' thereby overlook- 
ing the fact that Spl is not the only protein which 
recognizes this important element. In addition to Sp3 
and Sp4 there exist at least three other proteins, BTEB1 
(basic transcription element binding protein 1) (Imataka 
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et al., 1992), TIEG1 and TIEG2 (TGFp-inducible early 
protein genes 1 and 2) (Subramaniam et al., 1995; Cook 
et al., 1998; Fautsch et al., 1998) which have a binding 
specificity very similar, if not identical, to Spl (Sogawa 
et al., 1993b; Cook et al., 1998). In BTEB1, TIEG1 and 
TIEG2 the DNA-binding domain is found also close to 
the C-terminus and consists of three C2H2-type zinc 
fingers. Most significantly, all amino acids which are 
believed to specifically contact the DNA are conserved 
between these three proteins and Spl, Sp3 and Sp4. 
However, the N-terminal domains of these proteins do 
not share any similarity with the Sp-proteins. BTEB1 
contains acidic and hydrophobic amino acid stretches, 
and TIEG1 as well as the closely related protein TIEG2 
are rich in proline residues. 

In addition, another subgroup of zinc finger proteins, 
including the so called Krtippel-like factors, has to be 
considered when essential 'Spl -binding sites' are ana- 
lyzed in promoters. These proteins also contain three 
zinc fingers close to their C-terminus and bind preferen- 
tially to so called GT-or CACCC-boxes. Sp-family mem- 
bers also recognize these elements. Thus, Sp-proteins 
and Kriippel-like factors have an overlapping binding 
specificity. Currently this subfamily of zinc finger pro- 
teins consists of at least 10 different members, the so 
called Kruppel like factors EKLF (Miller and Bieker, 
1993), BKLF/TEF-2 (Crossley et al., 1994), GKLF 
(Shields et al., 1996), IKLF (Conkright et al., 1999), 
LKLF (Anderson et al., 1995), UKLF (Matsumoto 
et al., 1998) and FKLF (Asano et al., 1999) and 
ZNF741 (only sequence in data base), AP-2rep (Imhof 
et al., 1999) and ZF9 (Ratziu et al., 1998). Another 
factor called BTEB2 (Sogawa et al., 1993a) is very likely 
an N-terminally truncated human ortholog of murine 
IKLF. Understanding the biological function and the 
mode of action of all these GC/GT-box binding proteins 
will be a scientific challenge. 



9. Conclusions 

The cloning and initial characterization of a family 
of Sp-proteins as well as other GC/GT-box binding 
proteins provides much information about the potential 
functions and activities of these proteins. The most 
obvious question, however, concerns the specificity of 
the individual family members especially of Spl and 
Sp3. Neither binding site preferences nor differential 
expression patterns seem to confer specificity of these 
two proteins. However, activation of a given promoter 
requires the binding of multiple transcription factors 
which might bind cooperatively to their sites or might 
act synergistically by other mechanisms. Thus far, little 
is known how Spl and Sp3 act on natural promoters in 
combination with other transcription factors in vivo. 
Specificity could be obtained also by the interaction 



with co-activators or co-repressors. In that regard it 
might be interesting to know whether the co-activator 
complex CRSP (Ryu et al., 1999) is specific for Spl or 
whether it can also cooperate with other Sp-family 
members. 

The most interesting question at this stage concerns 
the key target genes for Spl and Sp3. The Spl-knockout 
suggests that there are only a few genes which are 
regulated by Spl. However, only a few genes have 
actually been tested. New techniques such as DNA 
microchip arrays could be helpful in identifying addi- 
tional genes whose expression is dependent on Spl. On 
the other hand, knocking out a single family member 
may not reflect the whole truth because overlapping 
functions with other Sp-family members might conceal 
important in vivo functions. In addition, the Spl -knock- 
out mice do not provide information on Spl in 
differentiated tissues in the adult animal. For that we 
have to await for the conditional disruption of the Spl 
gene in specific tissues and at specific stages during 
development. 
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Cancer chemotherapy encompasses a large number of 
well-documented and clinically established methods 
for the treatment of malignant diseases. However, the 
efficacy of these approaches is often hampered by an 
insufficient therapeutic index, lack of specificity, and 
the emergence of drug-resistant cell subpopulations. 
One approach aimed at enhancing the selectivity of 
cancer chemotherapy for solid tumors relies on the 
application of gene therapy technologies. 

Gene therapies are techniques for modifying the cel- 
lular genome for therapeutic benefit. In cancer gene 
therapy, both malignant and nonmalignant cells may 
be suitable targets. The possibility of rendering cancer 
cells more sensitive to chemotherapeutics or toxins by 
introducing "suicide genes" was suggested in the late 
1980s. This approach has two alternatives: toxin gene 
therapy, in which the genes for toxic products are trans- 
fected directly into tumor cells; and enzyme-activating 
prodrug therapy, in which the transgenes encode 
enzymes that activate specific prodrugs to create toxic 
products. The latter approach, known as gene-directed 
enzyme prodrug therapy (GDEPT) (1, 2) or virus-direct- 
ed enzyme prodrug therapy (VDEPT) (3), may be used 
in isolation or combined with other strategies, such as 
the biotherapies described elsewhere in this Perspective 
series. VDEPT using selectively replicating viruses as 
vectors represents a promising means to target suicide 
genes specifically to tumor cells, an approach that is 
only beginning to be explored (for examples, see Her- 
miston, this Perspective series, ref. 4). 

GDEPT and VDEPT are two-step treatments for 
solid tumors (Figure 1). In the first step, the gene for 
a foreign enzyme is delivered and targeted in a vari- 
ety of ways to the tumor where it is to be expressed. 
In the second step, a prodrug is administered that is 
selectively activated to the drug by the foreign 
enzyme expressed in the tumor. Ideally, the gene for 
the enzyme should be expressed exclusively in the 
tumor cells and should reach a concentration suffi- 
cient to activate the prodrug for clinical benefit. The 
catalytic activity of the expressed protein must be 
adequate to activate the prodrug under physiologi- 
cal conditions. Because expression of the foreign 
enzymes will not occur in all cells of a targeted 
tumor in vivo, a bystander effect (BE) is required, 
whereby the prodrug is cleaved to an active drug that 
kills not only the tumor cells in which it is formed, 
but also neighboring tumor cells that do not express 
the foreign enzyme (5). 



The genes can be engineered to express their prod- 
ucts either intracellularly or extracellularly in the 
recipient cells (6). There are potential advantages to 
each approach. When the enzyme is intracellularly 
expressed, the prodrug must enter the cells for acti- 
vation, and subsequently the active drug must dif- 
fuse through the interstitium across the cell mem- 
brane to elicit a BE. Cells in which the enzyme is 
expressed (tethered to the outer surface) are able to 
activate the prodrug extracellularly. A more sub- 
stantial BE could therefore be generated with extra- 
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Figure 1 

GDEPT, a form of suicide gene therapy, aims to maximize the effect of a 
toxic drug and minimize its systemic effects by generating the drug in situ 
within the tumor. In the first step in this procedure, the gene for an exoge- 
nous enzyme is delivered and expressed in the tumor cells. Subsequently, 
a prodrug is administered that is converted to the active drug by the for- 
eign enzyme expressed inside or on the surface of tumor cells. 
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cellular gene product expression, but spread of the 
active drug into the general circulation is a possible 
disadvantage (1, 6). 

Parameters that influence the success of GDEPT systems 

Effective tumor destruction with GDEPT depends on 
the design of the gene- therapy vectors, the chemistry of 
the prodrugs and their toxic metabolites, and the 
means to deliver one or both components specifically 
to target cells. Vectors, the vehicles in which the trans- 
genes reach the tumor cell, must be carefully tailored 
to specific GDEPT systems. The specificity of targeting 
to cancer cells and efficient transfection are essential 
for effective GDEPT, as are the toxicity of the vector 
and the uptake of prodrugs or drugs by normal and 
malignant cells. GDEPT systems (7-10) require the 
design of tailored prodrugs and the use of a foreign 



enzyme — one whose activity is absent from the 
patient's tissues (or at least from the tissue to be treat- 
ed), and that can convert the prodrug to the drug in a 
specific manner. Beneficial immune effects may be 
induced either by stimulation of the host immune sys- 
tem or by the use of additional cytokine gene therapy 
(see article on immunomodulation by Agha-Moham- 
madi and Lotze, this Perspective series, ref. 11). The effi- 
ciency of the BE is another key determinant of the suc- 
cess of these systems. 

The enzymes proposed for GDEPT fall into two cate- 
gories. The first comprises foreign enzymes of non- 
mammalian origin, with or without human homo- 
logues. Examples are viral tyrosine kinase (TK), bacterial 
cytosine deaminase (CD), carboxypeptidase G2 (CPG2), 
purine nucleoside phosphorylase (PNP), and nitrore- 
ductase (NR). The second category consists of enzymes 
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of human origin that are absent or are expressed only at 
low concentrations in tumor cells, such as deoxycytidine 
kinase (dCK) and cytochrome P450. The human homo- 
logues of enzymes in the first category have different 
substrate structural requirements than the foreign 
enzymes have. Their main disadvantage as therapeutic 
agents may be the potential to elicit an immune response 
in humans, although this may actually provide benefits 
to therapy. Enzymes of the second category are unlikely 
to induce immune responses, but their use will in most 
cases lead to some prodrug activation in normal cells. 

The second step is administration of the prodrug. 
GDEPT's advantages over conventional prodrug ther- 
apy (its greater selectivity and higher drug concentra- 
tions) arise from the fact that the drug is generated in 
situ in the target tumor. Prodrugs for GDEPT must be 
considerably less cytotoxic than their corresponding 
active drugs, and must be suitable substrates for the 
activating enzyme under physiological conditions. In 
addition, they should be chemically stable under phys- 
iological conditions and diffuse readily in the tumor 
interstitium. They should also have good pharmaco- 
logical and pharmacokinetic properties, and release an 
active drug with a good BE. Prodrugs must also be tai- 
lored to their site of activation: if activation occurs 
intracellularly, they must be able to cross the tumor cell 
membrane; whereas if the enzyme is expressed on the 
surface of cells, there is no such requirement, but the 
active drug must be able to cross the cell membrane. 

The cytotoxicity differential between the prodrug and 
its corresponding drug should be as high as possible, 
and the active drug should be highly diffusible or be 
actively taken up or exported by cells. The design of a 
prodrug that can release a highly effective drug requires 
knowledge of the quantitative structure-activity rela- 
tionship (QSAR). For this reason, and in order to obtain 
proof of principle for GDEPT, most of the prodrugs 
used in suicide gene therapy to date have been clinical- 
ly licensed anticancer agents with known pharmaco- 
logical, pharmacokinetic, dosage, and safety parameters. 
However, it is likely that the next generation of prodrugs 
will be specially tailored for GDEPT. One possibility is 
the use of "self-immolative" prodrugs. 

A self-immolative prodrug can be defined as a com- 
pound that generates an unstable intermediate that 
then extrudes the active drug in subsequent steps. 
Although the activation process that generates the 
unstable species is generally enzyme-mediated, extru- 
sion occurs spontaneously through the fragmenta- 
tion of the prodrug, often at a distinct site. Self- 
immolative prodrugs allow their lipophilicity to be 
altered with minimal effect on the activation kinetics. 
Indeed, kinetics of activation that are unfavorable due 
to the chemical or steric features of the active drug 
can be improved by this approach. The range of drugs 
that can be converted to prodrugs is greatly extended, 
and is unrestricted by the structural substrate require- 
ments for a given enzyme. Figure 2 shows one self- 
immolative prodrug activation reaction, the activa- 
tion of the doxorubicin prodrug by the enzyme CPG2 
(12, 13), followed by the spontaneous extrusion of the 
DNA-damaging agent doxorubicin. 



Table 1 summarizes pharmacokinetic data on 1 1 
enzyme/prodrug pairs that have been designed for 
use in GDEPT systems. With the exception of 
cyclophosphamide, ifosfamide, and some prodrugs 
designed for CPG2 and NR, none of the prodrugs 
shown are self immolative. Self-immolative prodrugs 
derived from alkylating agents and anthracyclines 
have been synthesized for activation by CPG2 (12, 
13), and self-immolative derivatives from secocydo- 
propylindolines and ene-diyne prodrugs have been 
synthesized for use with NR (7). 

Also shown in Table 1 are two parameters that are 
useful in comparing the different GDEPT systems: the 
potential of activation of a given system and its degree 
of activation. The first parameter is defined as the ratio 
of ICso's of the prodrug and of the active drug in a non- 
transfected control tumor cell line. It represents the 
maximum possible efficiency of a given enzyme/pro- 
drug system toward that cell line. The degree of activa- 




Self immolation of prodrugl to yield the chemotherapeutic drug dox- 
orubicin. The doxorubicin prodrug (1) is cleaved bycarboxypeptidase G2 
(CPG2), releasing the glutamic acid (3) and an unstable intermediate (2). 
The latter undergoes a spontaneous 1,6 elimination, extruding doxoru- 
bicin (4), a quinone imine (5), and carbon dioxide. This scheme can be 
readily modified to allow the production of structurally similar drugs, 
such as daunorubicin. 
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tion is defined as the ratio of the IC50 of the prodrug in 
the nontransfected cell line to the IC 50 of the prodrug 
in the transfected or infected derivative of the cell line 
that expresses the activating enzyme. Both the poten- 
tial for activation and the degree of activation depend 
on the cell line's sensitivity to the drug, but the degree 
of activation also reflects the efficiency of the prodrug 
system in the context of that cell line; if the enzyme in 
the transfected line is sufficient to convert the prodrug 
immediately and quantitatively to the active form of 
the drug, the degree of activation is identical to the 
potential of activation for that GDEPT system. 

Improving activation kinetics 

The concentration of the drug and the rate at which 
it is released at the activation site depend on the kinet- 
ic parameters of the enzyme/prodrug system. Unfor- 
tunately, the published V m3x and K m values — which 
measure the maximum velocity of the activation reac- 
tion and the concentration of substrate needed to 
reach half this maximum velocity — are not sufficient 
to allow enzyme/prodrug systems to be compared. As 
a rule, however, low K m and high V max (or fe cac ) would 
be expected to be found in relatively effective systems, 
and the comparison of the yeast CD with bacterial CD 
bears out this prediction. As shown in Table 1, the 
yeast enzyme, which proved to be more effective than 
its bacterial counterpart in a GDEPT experiment, 
exhibits lower K m and higher V max values (14). Howev- 
er, the literature does not supply consistent values for 
the Vmax of these enzymes, because V maji has been 
determined under very different experimental condi- 
tions for the various systems and is expressed in dif- 
ferent ways, making direct comparisons impossible. 
Despite these caveats, it appears from the data in 
Table 1 that prodrugs such as CMDA (a substrate of 
CPG2), GCV (a substrate of HSV-TK), and CPT-11 (a 
substrate of CA) are superior to 5-FC (a substrate of 
CD) or 5'-FDUR (a substrate of TP), because the lat- 
ter two have high K m and low V max values. The 
turnover number, fe cat , provides additional informa- 
tion about the reaction rate, but the implications of 
this measure for tumor cell killing is unclear, because 
it is not yet known if sudden release of the active drug 
is more effective than slow, constant release, or if qui- 
escent and proliferating cells differ in their sensitivi- 
ty to drugs released at different rates. 

New techniques that are available to increase the effi- 
cacy of enzymes to activate prodrugs for GDEPT were 
recently reviewed (9). Some of these approaches build 
on crystallographic descriptions of the active site of 
the enzyme involved in the enzyme/prodrug system, 
which should permit the molecular modeling, and 
eventually the rational synthesis, of substrates that are 
well-suited for a GDEPT system. An alternative is to 
modify the active site of the enzyme by site-directed 
mutagenesis in order to increase its catalytic efficien- 
cy toward an existing substrate. Black et al. (15) 
applied these techniques to obtain mutants of HSV- 
TK with improved kinetic parameters for the prodrugs 
GCV and ACV. Similarly, Smith and colleagues (16) 
performed site-directed mutagenesis on carboxypepti- 



dase A to improve the efficiency of this enzyme toward 
specific substrates that, by design, are less prone to 
interfere with other human peptidases (16). 

Prodrugs may also be activated by a metabolic cascade 
involving the sequential action of several enzymes, for 
example the activation of GCV to GCV triphosphate 
(GCVTP) by three different kinases (HSV-TK, guanylate 
kinase, and nucleoside diphosphate kinase), acting in 
series. This approach requires the cotransfection of 
genes for each of the enzymes, but is expected to increase 
the overall yield of the desired final metabolite, the active 
drug. In the case of GCV, Blanche et al. recently claimed 
that the simultaneous transfection of these three genes 
allows cells to convert more than 90% of the prodrug to 
GCVTP (17). Likewise, the cotransfection of the genes 
for cytochrome P450 and P450 reductase significantly 
increases the conversion of CP to it toxic metabolites, 
and therefore improves the overall efficiency of the 
cytochrome P450/CP GDEPT system (18) (see Table 1). 

Interpreting potential of activation 
and degree of activation 

As discussed above, the potential of activation of a 
GDEPT system reflects its maximal theoretical effi- 
ciency, at least toward a specific cell type. Unfortunate- 
ly, not all the systems can be defined in this way, 
because multiple products may be released, and the 
toxicity of each of these metabolites may not be avail- 
able. Thus, although GCV is relatively nontoxic, its 
monophosphorylated derivative, GVCMP, is highly 
cytotoxic (19), so the potential of activation of this sys- 
tem cannot be calculated accurately from the known 
IC50 of the triphosphorylated derivative, GCVTP. Like- 
wise, for CP activated by cytochrome P450, a maximum 
differential of 20- to 25-fold is theoretically achievable 
based on the IC50 values of CP and its corresponding 
phosphoramide mustard. However, the degree of acti- 
vation obtained was found to be 100-fold (20), which 
suggests that the CP phosphoramide mustard is not 
the final active metabolite. 

The optimal approach to improving the activation 
potential of a system is to design prodrugs with lower 
cytotoxicity, but the complementary strategies of 
increasing the cytotoxicity or improving the efficiency 
of release of the drug are also helpful. Some highly cyto- 
toxic compounds (with ICso in the nM range) such as 
ene-diynes, cyclopropylindolines, and taxoids are now 
available, but generally their structures are complicated 
and efficient ways are needed to convert them to low- 
cytotoxicity prodrugs. Designing self-immolative ana- 
logues of these prodrugs could be a way to move for- 
ward, as could modifications that improve the uptake 
of the prodrug by enzymatic modifications (21) or alter 
the lipophilicity of the prodrug (22). The latter capabil- 
ity is especially useful for tailoring the prodrug for use 
with an extracellular or intracellular activating enzyme. 

The degree of activation, which reflects the efficiency 
of the system, is another parameter useful for charac- 
terizing a GDEPT system. By definition, it must be 
lower than or at least equal to the potential of activation 
for the system, as is seen for all the systems analyzed in 
Table 1. The interpretation of both of these parameters 



1164 



The Journal of Clinical Investigation | May 2000 | Volume 105 | Number 9 



is complex. Their values offer some insight into the in 
vitro situation, where a single cell type, transfected with 
a gene for an activating enzyme, is challenged with the 
prodrug or its toxic metabolites either before or after 
transfection. For several reasons, these values may not 
accurately reflect the situation in vivo. Additional fac- 
tors, such as pharmacokinetics, prodrug distribution, 
and immune responses complicate the overall picture. 
Moreover, obtained IC50 values may vary for different 
cell types, and not all cells in a tumor may be accurately 
modeled by the cell line chosen for in vitro study. 
Despite these concerns, these parameters provide a 
rational basis for comparing different GDEPT systems 
and should also be helpful in designing new systems. 

Increasing the BE 

The extent of the BE can be determined from the effect 
of the treatment on non-genetically modified cells that 
takes place after prodrug administration, when only a 
fraction of the tumor mass is genetically modified to 
express an activating enzyme (5, 8). The striking suc- 
cesses described in GDEPT would surely not be possi- 
ble in the absence of such an effect. As described, some 
models require only 1-2% of cells to be genetically mod- 
ified to obtain therapeutically significant results. 

Toxic metabolites are formed after prodrug activa- 
tion and may be released from dead and dying geneti- 
cally modified cells. This mechanism is postulated for 
5-FU formed from 5-FC; for the metabolites of CP and 
IP, aldophosphamide, phosphoramidic mustards, and 
acrolein; for benzoic acid mustard released from 
CMDA; and for 6-MeP, formed from the corresponding 
deoxynucleoside. Supporting this model for the BE is 
the observation that cell-to-cell contact is not required 
for the killing of untransfected cells by these agents, 
either in vitro or in vivo. In vitro, 30% of cells express- 
ing CD suffices for the eradication of a whole cell pop- 
ulation by 5-FC. This BE is dramatically greater in vivo, 
even with immunocompromised mice: 2% CD + tumor 
cells can yield 100% tumor regression in athymic mice; 
with 4% CD + cells, 66% of animals are cured of their 
xenografts. When similar experiments are performed in 
immunocompetent animals, the results are better still. 

For purine or pyrimidine nucleosides, the toxic 
metabolites are not diffusible across cell membranes, so 
the HSV-TK/GCV system apparently requires cell-to-cell 
contact, specifically gap junction formation (23), to dis- 
play a BE. Consistent with this model, one report 
showed that tumor cells resistant to BE did not show 
dye transfer from cell to cell, whereas BE-sensitive 
tumor cells did. Furthermore, dieldrin, a drug known to 
decrease gap junction communications, diminished the 
dye transfer and also inhibited the BE, leading to the 
suggestion that the BE of this system could be enhanced 
by pharmacological manipulation of the gap junctions 
in vivo. Apigenin, a flavonoid, and lovastatin, an 
inhibitor of HMG-CoA reductase, both upregulate gap 
junction function and dye transfer in tumors express- 
ing gap junctions. Touraine et al. (24) studied the con- 
trol of tumors grown from a mixture of 10% HSV-TK + 
adenocarcinoma cells and 90% TEC cells using GCV. In 
the absence of lovastatin or apigenin, 30% of animals 



treated become tumor-free, but when tumor-bearing 
mice were administered lovastatin or apigenin during 
GCV treatment, their antitumor response rate doubled. 

Although these results are consistent with the hypoth- 
esis that gap junctions mediate the BE after GCV treat- 
ment, other data suggest that additional mechanisms 
are involved. In one study of human lung tumor cell 
lines of different origins, significant cell killing occurred 
when only 10% of cells expressed HSV-TK. In this sys- 
tem, gap junction communication was not apparent 
from measuring the rapid intercellular transport of 
Lucifer Yellow, which detects "rapid-transfer" gap junc- 
tional communications, although it could be seen by 
the slow transfer of a different dye, calcein-AM, which 
measures the "slow-transfer" gap junctions. However, 
neither an inhibitor (1-octanol) nor an enhancer (all- 
trans retinoic acid) of gap junction communication 
affected the extent of the BE, suggesting either that low 
levels of gap junctions can produce a maximal BE or 
that bystander cell killing occurs by other means (25). 
Boucher et al. compared the efficacy of the HSV- 
TK/GCV system in two human carcinoma cell lines 
after exposure to GCV and found that the BE depended 
on the concentration of the enzyme, the number of cells 
expressing HSV-TK, and the overall confluence of the 
cells, but not on the activity of functional gap junctions, 
as assessed by the Lucifer Yellow assay (26). 

Another suggestion is that the TK enzyme is trans- 
ported by apoptotic vesicles or through gap junctions. 
Phagocytosis of material (e.g., hydrolases or other lytic 
enzymes) from dying TK + cells by bystander cells has 
also been suggested as a mechanism for the BE. Apop- 
tosis was detected in bystander cells and it was found 
that this event could be inhibited by BCL2 expression. 
However, during the apoptosis induction period in 
bystander cells cocultured with HSV-TK-expressing 
cells, no phagocytosis was observed. It has also been 
suggested that killing of tumor cells by apoptosis could 
heighten the immune response to wild-type tumor cells 
by a priming effect. 

A quantitative expression of the BE was recently pro- 
posed using the NR/CB1954 system in a range of 
human tumor-cell types. The IC 5 o's of non-NR-express- 
ing cells were measured in the presence of varying pro- 
portions of NR-expressing cells. The shift in IC S0 was 
used to calculate a value for the BE, termed the trans- 
mission efficiency (TE), which is the decrease in IC S0 
caused by the BE, expressed as a percentage of the max- 
imum measured decrease. The percentage of NR- 
expressing cells for which the TE was 50% (TE S0 ) is a 
single data point for the BE. The TE 50 in the cell lines 
ranged from 0.3% to approximately 2% (27). 

There were early suggestions that the immune 
response improves the efficacy of GDEPT. Although 
the BE has been observed in immunocompromised 
animals, recent findings suggest that the BE in vivo is 
mediated largely through the release of cytokines. 
Ramesh et al. (28) report that GCV treatment of carci- 
nomas that contained a mixture of HSV-TK + cells 
resulted in almost total tumor ablation in immuno- 
competent mice, but not in athymic animals of the 
same strain. In a similar experiment, when HSV-TK was 
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transfected into cells grown as xenografts, the tumor 
growth was inhibited for up to SO days in GCV-treat- 
ed, immunocompromised nude mice, but failed to 
eliminate all the tumor cells in these animals, and 
tumors regrew 40-50 days after implantation. By con- 
trast, immunocompetent BALB/c mice developed 
long-lasting immunity in response to HSV-TK trans- 
duction followed by GCV treatment (29). Taken 
together, these studies strongly suggest that an intact 
immune system is important for long-term tumor 
suppression with TK in vivo. 

IL-2 appears to be critical for immune-mediated 
tumor suppression in this system. In one experiment, 
cells grown as xenografts in syngeneic mice were 
injected with an adenoviral vector containing the HSV 
TK gene, the 112 gene, or both, followed by treatment 
with GCV (30). Whereas the tumors continued to 
grow in the animals injected with a control vector or 
the vector carrying IL2, those treated with HSV TK, 
with or without coadministration of IL2, exhibited 
tumor necrosis and regression. However, only animals 
treated with both genes developed effective systemic 
antitumoral immunity against tumorigenic rechal- 
lenges. The antitumoral immunity was associated 
with the presence of tumor-specific cytolytic CD8 + T 
lymphocytes. A third vector containing the mouse 
GMCSF gene enhanced and prolonged this antitu- 
moral immunity, allowing animals treated with all 
three genes to survive for longer than four months 
without recurrence. These and similar findings (30) 
establish the synergism between suicide gene and 
cytokine gene therapies. 

Future perspectives 

Some hurdles must be overcome before GDEPT will 
become a clinically efficient treatment of cancers. Major 
improvements are needed in vector design to enhance 
targeting and delivery of suicide genes. Multiple options 
are available, including nonviral vectors, more complex 
systems involving coexpression of suicide genes with 
immunological or tumor-suppressor genes, and selec- 
tively replicating viruses. Double suicide gene therapy, 
in which a combination of suicide genes is introduced 
simultaneously, shows promise in vitro and in vivo. The 
released active drugs in such an approach can act by dif- 
ferent mechanisms, leading to a synergistic effect on 
tumor cells or an enhanced BE, particularly if cell-per- 
meant and cell-impermeant active metabolites can be 
released together. Additionally, the occurrence of resist- 
ant populations is less likely for drugs with different 
mechanisms of action. Uckert et al. (31) have shown for 
human carcinoma cell lines grown in vivo that double 
suicide gene therapy (involving HSV-TK and CD) 
allowed the elimination of tumors, but neither gene 
applied individually gave this result. 

Thus, GDEPT systems have already shown efficacy 
in vivo. Future developments in this technology 
should use mutagenesis to obtain more efficient acti- 
vation of a given prodrug, or to adapt the active site 
so that it binds better to prodrugs that are not sub- 
strates for endogenous enzymes. The prodrugs, too, 
should be redesigned to create better substrates for 



the enzymes, to maximize drug release or the BE, to 
take advantage of self-immolative strategies of acti- 
vation, or to allow the active drug to accumulate more 
readily in tumor cells. Finally, it will also be useful to 
investigate the ways in which different prodrug sys- 
tems synergize with each other or with other cancer 
treatments. The combination of GDEPT with radio- 
therapy or immunotherapy has previously been sug- 
gested. Such approaches may involve a sequential 
treatment schedule (GDEPT/radiation therapy or 
GDEPT/immunotherapy). The transfection of suicide 
genes together with genes that are able to increase the 
sensitivity of the tumors to radiation or enhance the 
potential of the host immune system is an alternative 
strategy. Other combination therapies are possible, 
such as applying GDEPT in conjunction with repli- 
cating oncolytic adenoviruses, such as ONYX-015 
(32; see also Heise and Km, this Perspective series, ref. 
33) or ONYX-838, or the use of such viruses as carri- 
ers for the suicide genes (see Hermiston, this Perspec- 
tive series, ref. 4). The combination of these replicat- 
ing adenoviruses with conventional chemotherapy 
has proven highly effective, and replacing the 
chemotherapeutic arm in the ONYX-015 study with 
GDEPT might well provide additional benefits. 
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